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Kinase sequences useful for developing compounds for the prevention and/or treatment 
of metabolic diseases and nucleotide sequences encoding such kinase sequences. 

The present invention relates to nucleotide sequences that are useful in the 
pharmaceutical field. 

In particular, the invention relates to nucleotide sequences that encode and/or may 
be used to express amino acid sequences that are useful in the identification and/or 
development of compounds with (potential) activity as pharmaceuticals, in particular of 
compounds for the prevention and/or treatment of metabolic diseases such as diabetes 
and obesity. These nucleotide sequences, which will be further described below, will also 
be referred to herein as "nucleotide sequences of the invention". 

The invention also relates to the amino acid sequences - such as proteins and/or 
polypeptides - that are encoded by, and/or that may be obtained by suitable expression of, 
the nucleotide sequences of the invention. These amino acid sequences, which will be 



further described below, will also be referred to herein as "amino acid sequences of the 
invention". 

The invention also relates to the use of the nucleotide sequences of the invention, 
preferably in the form of a suitable genetic construct as described below, in the 
transformation of host cells and/or host organisms, for example for the expression of the 
amino acid sequences of the invention. The invention also relates to host cells and/or host 
organisms that have been transformed with the nucleotide sequences of the invention 
and/or that can express the amino acid sequences of the invention. 

The invention farther relates to methods for the identification and/or development 
of compounds that can modulate the (biological) activity of the amino acid sequences of 
the invention, in which the abovementioned nucleotide sequences, amino acid sequences, 
genetic constructs, host cells and/or host organisms are used. Such methods which will 
usually be in the form of an assay or screen, as will also be further described below. 

The invention also relates to the use of the nucleotide sequences, amino acid 
sequences, genetic constructs, host cells and/or host organisms of the invention in 
(methods for) identifying and/or developing compounds that can modulate the 
(biological) activity of the amino acid sequences of the invention. 



WO 2004/056982 PCT/EP2003/0 14674 

2 

Also, the invention relates to compounds that can modulate the (biological) 
activity of the amino acid sequences of the invention, to compositions that contain such 
compounds, and to the use of such compounds in the preparation of such compositions. 

In particular, the invention relates to such compositions that are in the form of 
5 pharmaceutical compositions, and more in particular in the form of pharmaceutical 
compositions for the prevention or treatment of metabolic diseases such as diabetes or 
obesity, and also to the use of compounds that can modulate the (biological) activity of 
the amino acid sequences of the invention in the preparation of such pharmaceutical 
compositions. 

10 The invention further relates to the use of the nucleotide sequences, amino acid 

sequences, genetic constructs, host cells and/or host organisms of the invention in 
(methods for) identifying and/or developing compounds that can be used in the 
prevention or treatment of metabolic diseases. 

Other aspects, embodiments, applications and advantages of the present invention 
15 will become clear from the further description below. 

The present invention was established from the finding that the amino acid 
sequences of the invention are involved in metabolic processes (as further described 
below) and thus can be used as (potential) "target(s)" for in vitro and/or in vivo 
interaction with chemical compounds and other factors (with the term "target" having its 
20 usual meaning in the art, vide for example the definition given in WO 98/06737), and 
also from the finding that the nucleic acid sequence and amino acid sequences of the 
invention are involved in metabolic diseases. Consequently, compounds and/or factors 
that have been identified as interacting with the amino acid sequences of the invention 
(e.g. by the methods as described hereinbelow) may be useful as active agents in the 
25 pharmaceutical field, and in particular for the prevention and treatment of metabolic 
diseases. All this is supported by the following experimental data/observations: 

In an experimental model for fat handling, metabolism and storage using the model 
organism C. elegans (which model is further described in the Examples), 
downregulation of the C elegans gene T17E9.1a by RNA interference strongly 
30 reduces the fat storage phenotype in said nematode. As further described below, 

T17E9.1a is the C. elegans ortholog of STE20 like kinase in mammals. 
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Some particularly preferred examples of nucleotide sequences of the invention are: 

the nucleotide sequence of SEQ ID NO:l (T17E9.1a, also known as kin- 18), which 
is a sequence derived from the nematode worm C. elegans; and 
the human orthologs of said C.elegans sequence, as may be identified by 
bioinformatic comparison of the C.elegans sequence with the human genome. Some 
preferred, but non-limiting orthologues are given in SEQ ID NOS: 3, 5, 7 and 9. 
Generally herein, the use of the human nucleotide sequences of SEQ ID NOS: 3, 

5, 7 and 9 and/or the use of nucleotide sequence derived therefrom will be preferred, in 

particular when the invention is used to develop compounds for pharmaceutical use. 
In a broader sense, the term "nucleotide sequence of the invention" also 

comprises: 

parts and/or fragments of any of the nucleotide sequence of SEQ ID NO'S: 1, 3, 5, 7 
and 9; 

(natural and/or synthetic) mutants, variants, alleles, analogs, orthologs (hereinbelow 
collectively referred to as "mutants") of any of the nucleotide sequence of SEQ ID 
NO'S: 1-5, as further described below. 

parts and/or fragments of such (natural or synthetic) mutants; 

nucleotide fusions of any of the nucleotide sequence of SEQ ID NO'S: 1-5 (or a part 

or fragment thereof) with at least one further nucleotide sequence; 

nucleotide fusions of (natural or synthetic) mutants (or a part or fragment thereof) 

with at least one further nucleotide sequence; 
in which such mutants, parts, fragments and/or fusions are preferably as further described 
below. 

The invention also comprises different splice variants of the above nucleotide 
sequences. 

Some particularly preferred examples of amino acid sequences of the invention 

are: 

the amino acid sequence of SEQ ID NO:2, which is a sequence derived from the 
nematode worm C elegans; and 
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the human analogs of said C.elegans sequence, as may be identified by bioinformatic 
comparison of the C.elegans sequence with the human genome. Some preferred, but 
non-limiting analogs are given in SEQ ID NOS: 4, 6, 8 and 10. 

In a broader sense, the term "amino acid sequence of the invention" also 
5 comprises: 

parts and/or fragments of the amino acid sequence of SEQ ID NOS: 2, 4, 6, 8 and 1 0. 

(natural and/or synthetic) mutants, variants, alleles, analogs, orthologs (hereinbelow 

collectively referred to as "analogs' 9 ) of the amino acid sequence of SEQ ID NOS: 2, 

4, 6, 8 and 10; 
1 0 - parts and/or fragments of such analogs; 

fusions of the amino acid sequence of SEQ ID NOS: 2, 4, 6, 8 and 10 (or a part or 

fragment thereof) with at least one further amino acid residue or sequence; 

fusions of the amino acid sequence of an analog (or a part or fragment thereof) with 

at least one further amino acid residue or sequence; 
15 in which such mutants, parts, fragments and/or fusions are preferably as further described 
below. 

The term "amino acid sequence of the invention" also comprises "immature" 
forms of the abovementioned amino acid sequences, such as a pre-, pro- or prepro- forms 
and/or fusions with suitable leader sequences. Also, the amino acid sequences of the 
20 invention may have been subjected to post-translational processing and/or be suitably 

glycosylated, depending upon the host cell or host organism used to express/produce said 
amino acid sequence; and/or may be otherwise modified (e.g. by chemical techniques 
known per se in the art). 

Generally herein, the use of the human amino acid sequence of SEQ ID NOS: 4, 
25 6, 8 and 10 and/or of amino acid sequences derived therefrom will be preferred, in 
particular when the invention is used to develop compounds for pharmaceutical use. 

From the prior art, the following is known about the nucleotide sequences, amino 
acid sequences and compounds of the invention: 

(1) SEQ ID NOS: 1 and 2. These are the nucleotide sequence and amino acid sequence, 
30 respectively, of the C. elegans gene/protein Tl 7E9. la (also known as "kin- 18") 

Reference is made to the following entry from the WORMBASE database: 
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for protein sequence: 

http://www.wormbase.orp/db/seq/protein?name=WP%3ACE014QS:class=Protein 
for DNA sequence: 

http://www.won7ibase.org/db/seq/sequence?name=Tl 7E9. 1 a;c!ass=Sequence 
as well as to Berman et al., Gene 279 (2001) 137-147. 

(2) SEQ ID NOS: 3 and 4. These are the nucleotide sequence and amino acid sequence, 
respectively, of the human gene/protein known as "JNK/SAPK inhibitory kinase' 1 
(also known as "JIK", "DPK" or "KDS", or STE20-like kinase). 

Reference is made to the following entries from the GENBANK database: 

- NMJ)16281(mRNA); and 

- NP_057365.2 (translation). 
See also: 

GENBANK entry AF1 35 1 58; Zhang et al., Biochem. Biophys. Res. Commun. 
274 (3), 872-879 (2000); MEDLINE 20384190, PUBMED 10924369; and 

- GENBANK entry AF179867; Tassi et al., J. Biol. Chem. 274 (47), 33287-33295 

(1999) ; MEDLINE 20026851; PUBMED 10559204 

(3) SEQ ID NOS: 5 and 6. These are the nucleotide sequence and amino acid sequence, 
respectively, of the human gene/protein known as "prostate derived STE20-like 
kinase" (also known as "PSK" or "TA02"). 

Reference is made to the following entries from the GENBANK database: 

- NM_016151.1 (mRNA) 

- NP_057235.1 (translation) 
See also: 

- GENBANK entry AF061943; Moore et al, J, Biol. Chem. 275 (6), 431 1-4322 

(2000) ; MEDLINE 20127920; PUBMED 10660600; 

- Chen et al., J. Biol. Chem., Vol. 274, No. 40, pp. 288803-28808 (1999). 

(4) SEQ ID NOS: 7 and 8. These are the nucleotide sequence and amino acid sequence, 
respectively, of the human gene/protein known as "thousand and one amino acid 
protein kinase" (also known as "TAOJ"). 

Reference is made to the following entries from the GENBANK database: 

- NM_004783 (mRNA) 
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- NP_004774.1 (tranlation) 
See also: 

- GENBANK entry AB020688; Nagase et al, DNA Res. 5 (6), 355-364 (1998); 
MEDLINE 99156230; PUBMED 10048485 

5 (5) SEQ ID NOS: 9 and 10. These are the nucleotide sequence and amino acid sequence, 
respectively, of the human gene/protein which with designation K1AA1361 (note that 
although this gene and protein are also known as "thousand and one amino acid 
protein kinase" or "TAOV\ KIAA1361 is a different gene/protein from SEQ ID NOS 
7 and 8). 

J 0 Reference is made to the following entries from the EMBL and SpTrEMBL databases 

- AY049015(EMBL) 

- Q9P2I6 (SPTREMBL) 

See also: Nagase et al., DNA Res. 7:65 (2000) and EMBL entry NR AB037782 
(http://www.ebi.ac.uk/cgi-bin/emblfetch7AB037782) 
15 Also, although the inventors do not wish to be limited thereto, it is believed that 

SEQ ID NOS: 5/6 on the one hand, and SEQ ID NOS: 7 and 8 on the other hand, are 
transcripts from the same gene/locus. 

Thus, in a first aspect, the invention relates to a nucleic acid, preferably in 
(essentially) isolated form, which nucleic acid encodes and/or can be used to express an 
20 amino acid sequence of the invention (as defined herein), and in particular the amino acid 
sequence of SEQ ID NOS: 2, 4, 6, 8 or 10. 

In another aspect, the invention relates to a nucleic acid, preferably in (essentially) 
isolated form, which nucleic acid comprises a nucleotide sequence of the invention, and 
in particular the nucleotide sequence of SEQ ID NOS: 1 , 3, 5, 7 or 9. 
25 In a yet another aspect, the invention relates to a nucleic acid, preferably in 

(essentially) isolated form, which nucleic acid essentially consists of a nucleotide 
sequence of the invention, and in particular a nucleotide sequence SEQ ID NO: 1, 3, 5, 7 
or 9. 

Collectively, these nucleic acids will also be referred to herein as "nucleic acids of 
30 the invention". Also, where appropriate in the context of the further description of the 
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invention below, the terms "nucleotide sequence of the invention" and "nucleic acid of 
the invention" may be considered essentially equivalent and essentially interchangeable. 

Also, for the purposes of the present invention, a nucleic acid or amino acid 
sequence is considered to "(in) essentially isolated (form)" - for example, from its native 
5 biological source - when it has been separated from at least one other component (and in 
particular macromolecule) with which it is usually associated, such as another nucleic 
acid, another protein/polypeptide or another (polymeric) biological component. In 
particular, a nucleic acid or amino acid sequence is considered "essentially isolated" 
when it has been purified at least 2-fold, in particular at least 10- fold, more in particular 
10 at least 100-fold, and up to 1000- fold or more. 

The nucleic acids of the invention may also be in the form of a genetic construct, 
again as further described below. These constructs will also be referred to herein as 
"genetic constructs of the invention". In a preferred embodiment, such a construct will 
comprise: 

1 5 a) the nucleotide sequence of the invention; operably connected to: 

b) one or more regulatory elements, such as a promoter and optionally a suitable 
terminator; 

and optionally also: 

c) one or more further elements of genetic constructs known per se; 

20 in which the terms "regulatory element", "promoter", "terminator", "further elements" 
and "operably connected" have the meanings indicated hereinbelow. 

In another aspect, the invention relates to a protein or polypeptide, preferably in 
(essentially) isolated form, said protein or polypeptide comprising an amino acid 
sequence of the invention (as defined above), and in particular the amino acid sequence 

25 of SEQ ID NOS: 2, 4, 6, 8 or 1 0. 

In a further aspect, the invention relates to a protein or polypeptide, preferably in 
(essentially) isolated form, said protein or polypeptide essentially consisting of an amino 
acid sequence of the invention (as defined above), and in particular of the amino acid 
sequence of SEQ ID NOS: 2, 4, 6, 8 or 10. 

30 In a further aspect, the invention relates to methods for transforming a host cell 

and/or a host organism with a nucleotide sequence, with a nucleic acid and/or with a 
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genetic construct of the invention. The invention also relates to the use of a nucleotide 
sequence, of a nucleic acid and/or of a genetic construct of the invention transforming a 
host cell or a host organism. 

In yet another aspect, the invention relates to a host cell or host organism that has 
been transformed and/or contains with a nucleotide sequence, with a nucleic acid and/or 
with a genetic construct of the invention. The invention also relates to a host cell and/or 
host organism that expresses, or (at least) is capable of expressing (e.g. under suitable 
conditions), an amino acid sequence of the invention. Collectively, such host cells/host 
organisms will also be referred to herein as "host cells/host organisms of the invention". 

In yet another aspect, the invention relates to methods for producing an amino 
acid sequence of the invention, in which a nucleotide sequence, nucleic acid, genetic 
construct, host cell or host organism of the invention is used. Such methods may for 
instance include expressing a nucleotide sequence of the invention in a suitable host cell 
or host organism (e.g. upon suitable transformation), and/or maintaining and/or 
cultivating a host cell or host organism of the invention under suitable conditions, i.e. 
such that an amino acid sequence of the invention is expressed or obtained. Optionally, 
these methods may also comprise (one or more steps for) isolating the amino acid 
sequence thus expressed/produced. The invention also relates to the use of a nucleotide 
sequence, a nucleic acid, a genetic construct and/or a host cell/host organism of the 
invention in such a method. 

In yet a further aspect, the invention relates to a method for identifying a 
compound that can modulate the (biological) activity of, and/or that can otherwise 
interact with, an amino acid sequence of the invention, which method is as further 
described below. The invention also relates to the use of a nucleotide sequence, a nucleic 
acid, a genetic construct, an amino acid sequence and/or a host cell/host organism of the 
invention in such a method. 

In yet a further aspect, the invention relates to a method for identifying a 
compound that can be used in (the preparation of a pharmaceutical composition for) the 
prevention and/or treatment of metabolic diseases (as further defined below), which 
method is as further described below. The invention also relates to the use of a nucleotide 
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sequence, a nucleic acid, a genetic construct, an amino acid sequence and/or a host 
cell/host organism of the invention in such a method. 

The invention also relates to compounds that can modulate the (biological activity 
of), and/or that can otherwise interact with, an amino acid sequence of the invention, 
5 either in vitro or preferably (also) in vivo, as further described below. The invention also 
relates to compositions that contain such compounds, and in particular to pharmaceutical 
compositions that contains such compounds. 

The invention further relates to the use of compounds that can modulate the 
(biological activity of), and/or that can otherwise interact with, an amino acid sequence of 
10 the invention in the preparation of these compositions, and in particular to the use of such 
compounds in the preparation of a pharmaceutical composition for the prevention and/or 
treatment of metabolic diseases. 

The invention also relates to compounds that can be used in the prevention and/or 
treatment of metabolic diseases (as further defined below), which compounds have or can 
15 be identified and/or developed using the method, nucleic acid sequence, amino acid 

sequence and/or host cell or host organism of the invention. The invention also relates to 
compositions that contain such compounds, and in particular to pharmaceutical 
compositions that contain said compounds. 

The invention also relates to the use of such compounds in the preparation of a 
20 pharmaceutical composition, and in particular to the use of such compounds in the 

preparation of a pharmaceutical composition for the prevention or treatment of metabolic 
diseases. 

Unless explicitly specified herein, all terms used in the present description have 
their usual meaning in the art, for which particular reference is made to the definitions 
25 given in WO98/06737 and EP 1 085 089. 

The nucleotide sequences and amino acid sequences of the invention may 
generally further be characterized by the presence of a kinase domain (identified using 
SMART™-analysis), as follows: 

30 - SEQ ID NO:2: from amino acid (a.a.) residue 30 to a.a. residue 289 
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SEQ ID NO:4: 
SEQ ID NO:6: 
SEQ ID NO: 8: 
SEQ ID NO: 10: 



from a.a. residue 24 to a.a. residue 277 



from a.a. residue 28 to a.a. residue 281 



from a.a. residue 28 to a.a. residue 281 



from a.a. residue 32 to a.a. residue 285 



10 



15 



20 



25 



On the basis of the above and on the basis of the prior art referred to above, and 
although the invention is not specifically limited to any specific explanation or 
mechanism, it is assumed that the nucleotide sequences and/or amino acid sequences 
have (biological) activity as kinases. 

In particular, and although the invention is again not limited to any specific 
explanation or hypothesis, it is assumed that the amino acid sequences of the invention 
may be involved in (the modulation of) the JNK/SAPK pathway (vide for example Zhang 
et al. and Tassi et al., both supra; as well as Yoneda et al., J. Biol. Chem., Vol. 276, 
No.17, p. 13935-13940 (2001)). 

For JNKs and their role in diabetes/obesity, reference is inter alia made to 
Hirosumi et al., Nature. Vol. 20, November 2002, 333-336. 

As is known in the art, biological activity of this kind can be measured using 
standard assay techniques for kinases, which are well known to the skilled person. Some 
preferred, but non-limiting examples include: 

the JNK/SAPK assay described by Zhang et al., supra, which is particularly suitable 
for the kinase of SEQ ID NO 4 or a similar kinase; 

the kinase assays described by Moore et al., supra, in particular the in vitro kinase 
assays described on page 431 5; 
The nucleotide sequence of SEQ ID NO: 1 was identified, and can be 
derived/isolated from/using the nematode C.elegans; in the manner as further described 
in Berman et al., or in any other suitable manner known per se. 

The nucleotide sequences of SEQ ID NOS: 3, 5, 7 and 9 were identified, and can 
be derived/isolated from/using human cells; in the manner as further described in the 
prior art referred to above, or in any other suitable manner known per se. 
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Also, it is expected that - based upon the disclosure herein - the skilled person will 
be able to identify, derive and/or isolate natural "mutants" (as mentioned above) of the 
nucleotide sequence of SEQ ID NOS: 1, 3 , 5, 7 and/or 9. For example, such mutants 
could be derived from (other individuals of) the same species (for example from an 
5 individual of a different strain or line, including but not limited to mutant strains or lines); 
and/or from (individuals of) other species (in which case these mutants will also be 
referred to herein as "orthologs"). Some examples of species from which such orthologs 
could be derived include, but are not limited to species of 

unicellular and/or micro-organisms such as bacteria, and yeast, 
10 - invertebrate multicellular organisms as such as insects and nematodes (for example, 
agronomically harmful insect or nematode species); 

vertebrate multicellular organisms as such as fish, birds, reptiles, amphibians and 
mammals; 

Preferably, a natural ortholog is derived from a mammal such as a mouse, rat, 
15 rabbit or dog. 

Such natural mutants may be obtained in a manner essentially analogous to the 
methods described in the prior art referred to above, or alternatively by: 

construction of a DNA library from the species of interest in an appropriate 
expression vector system, followed by direct expression of the mutant sequence; 
20 - construction of a DNA library from the species of interest in an appropriate 

expression vector system, followed by screening of said library with a probe of the 
invention (as described below) and/or with a(nother) nucleotide sequence of the 
invention; 

isolation of mRNA that encodes the mutant sequence from the species of interest, 

25 followed by cDNA synthesis using reverse transcriptase; 

and/or by any other suitable method(s) or technique(s) known per se, for which reference 
is for instance made to the standard handbooks, such as Sambrook et al, "Molecular 
Cloning: A Laboratory Manual" ( 2nd.ed.), Vols. 1-3, Cold Spring Harbor Laboratory 
Press (1989) and F. Ausubel et al, eds., "Current protocols in molecular biology", Green 

30 Publishing and Wiley Interscience, New York (1987). 
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It is also expected that - based upon the disclosure herein - the skilled person will 
be able to provide and/or derive synthetic mutants (as defined hereinabove) of the 
nucleotide sequences of SEQ ID NOS: 1, 3, 5, 7 and/or 9. 

Techniques for generating such synthetic sequences will be clear to the skilled 
5 person and may for instance include, but are not limited to, automated DNA synthesis; 
site-directed mutagenesis; combining two or more parts of one or more naturally 
occurring sequences, introduction of mutations that lead to the expression of a truncated 
expression product; introduction of one or more restriction sites (e.g. to create casettes 
and/or regions that may easily be digested and/or ligated using suitable restriction 

] 0 enzymes), and/or the introduction of mutations by means of a PCR reaction using one or 
more "mismatched" primers, using for example a sequence of a naturally occurring 
GPCR as a template. These and other techniques will be clear to the skilled person, and 
reference is again made to the standard handbooks, such as Sambrook et al. and Ausubel 
et al., mentioned above. 

1 5 Preferably, any mutants as described herein will encode amino acid sequences 

having one or more, and preferably all, of the structural characteristics/conserved features 
referred to above for the sequences of SEQ ID NO: 2, 4, 6, 8 and/or 10, and in particular 
will contain a kinase domain. 

It is also possible in the invention to use a part or fragment of the nucleotide 

20 sequences of SEQ ID NOS 1, 3, 5, 7 or 9; or a part or fragment of a (natural or synthetic) 
mutant thereof. These may for instance be 5' and/or 3' truncated nucleotide sequences, or 
sequences with an introduced in frame startcodon or stopcodon. Also, two or more such 
parts or fragments of one or more nucleotide sequences of the invention may be suitably 
combined (e.g. ligated in frame) to provide a (further) nucleotide sequence of the 

25 invention. 

Preferably, any such parts or fragments will be such that they comprise at least 
one continuous stretch of at least 15 nucleotides, preferably at least 30 nucleotides, more 
preferably at least 60 nucleotides, even more preferably more than 90 nucleotides, of one 
or more of the nucleotide sequences of SEQ ID NOS: 1, 3, 5 , 7 and/or 9. 
30 In particular, any mutants, parts or fragments as described herein may be such that 

they (at least) encode the active/catalytic site of the corresponding amino acid sequence 
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of the invention and/or a binding domain of the corresponding amino acid sequence of 
the invention 

Any mutants, parts and/or fragments as described herein are preferably (also) such 
that they are capable of hybridizing with one or more of the nucleotide sequences of SEQ 
5 ID NOS 1, 3, 5, 7 and/or 9, i.e. under conditions of "moderate stringency", and preferably 
under conditions of "high stringency". Such conditions will be clear to the skilled person, 
for example from the standard handbooks, such as Sambrook et al. and Ausubel et al., 
mentioned above, as well as in EP 0 967 284, EP 1 085 089 or WO 00/553 1 8. 

In particular, any mutants, parts and/or fragments as described herein may be such 
10 that they are capable of hybridizing with the nucleotide sequence of SEQ ID NO 1 , 3, 5, 7 
and/or 9 under the "stringent" hybridisation conditions described in WO 00/78972 and 
WO 98/49185, and/or under the hybridization conditions described in GB 2 357 768-A. 

Also, any mutants, parts and/or fragments as described herein will preferably have 
a degree of "sequence identity", at the nucleotide level, with one or more of the 
15 nucleotide sequences of SEQ ID NOS: 1, 3, 5, 7 and/or 9 of at least 50%, preferably at 
least 60%, more preferably at least 70%, even more preferably at least 80%, and in 
particular more than 90%, and up to 95% or more. 

For this purpose, the percentage of "sequence identity" between a given 
nucleotide sequence and the nucleotide sequence of SEQ ID NO:l, 3, 5, 7 and/or 9 may 
20 be calculated by dividing [the number of nucleotides in the given nucleotide sequence 
that are identical to the nucleotide at the corresponding position in the nucleotide 
sequence of the relevant SEQ ID NO] by [the total number of nucleotides in the given 
nucleotide sequence] and multiplying by [100%], in which each deletion, insertion, 
substitution or addition of a nucleotide - compared to the sequence of the relevant SEQ 
25 ID NO - is considered as a difference at a single nucleotide (position). 

Alternatively, the degree of sequence identity may be calculated using a known 
computer algorithm for sequence alignment such as NCBI Blast v2.0, using standard 
settings. 

Some other techniques, computer algorithms and settings for determining the 
30 degree of sequence identity are for example described in EP 0 967 284, EP 1 085 089, 
WO 00/55318, WO 00/78972, WO 98/49185 and GB 2 357 768-A. 
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Also, in a preferred aspect, any mutants, parts and/or fragments as described 
herein will encode proteins/polypeptides having a biological activity that is essentially 
similar to the biological activity described above for the sequences of SEQ ID NOS: 2, 4, 
6, 8 and/or 10, i.e. to a degree of at least 10%, preferably at least 50 % more preferably at 
least 75%, and up to 90%, as measured by. a suitable assay for kinase activity known per 
se, such as those mentioned above (preferred) or below. In particular, they will have 
activity as a kinase, as measured using a suitable assay of kinase activity, such as those 
referred to above, and preferably an activity which is at least 10%, preferably at least 50 
% more preferably at least 75%, and up to 90% or more, of the kinase activity of any one 
of the sequences of SEQ ID NOS: 2, 4, 6, 8 and/or 10. 

Preferably, any mutants, parts and/or fragments of the nucleotide sequence of the 
invention will (also) be such that they encode an amino acid sequence which has a degree 
of "sequence identity", at the amino acid level, with one or more of the amino acid 
sequence of SEQ ID NOS: 2, 4, 6, 8 and/or 10 of at least 50%, preferably at least 60%, 
more preferably at least 70%, even more preferably at least 80%, and in particular more 
than 90% and up to 95 % or more, in which the percentage of "sequence identity" is 
calculated as described below. 

Preferably, a nucleotide sequence of the invention will (also) have a length 
(expressed as total number of nucleotides), which is at least 50%, preferably at least 60%, 
more preferably at least 70%, even more preferably at least 80%, and in particular more 
than 90% and up to 95 % or more of the length of one or more of the nucleotide sequence 
of SEQ ID NOS: 1, 3, 5, 7 and/or 9. 

Generally, the nucleotide sequences of the invention, when in the form of a 
nucleic acid, may be DNA or RNA, and may be single stranded or double stranded. For 
example, the nucleotide sequences of the invention may be genomic DNA, cDNA or 
synthetic DNA (such as DNA with a codon usage that has been specifically adapted for 
expression in the intended host cell or host organism). Thus, the nucleotide sequences of 
the invention may contain intron sequences, and also generally comprises different splice 
variants. 

It is also within the scope of the invention to use a fusion of a nucleotide sequence 
of the invention (as described above) with one or more further nucleotide sequence(s), 
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including but not limited to one or more coding sequences, non-coding sequences and/or 
regulatory sequences. Preferably, in such fusions, the one or more further nucleotide 
sequences are operably connected (as described below) to the nucleotide sequence of the 
invention (for example so that, when the further nucleotide sequence is a coding 
sequence, the nucleotide fusion encodes a protein fusion as described below). 

Another embodiment of the invention relates to a nucleic acid probe that is 
capable of hybridizing with a nucleotide sequence of the invention under conditions of 
moderate stringency, preferably under conditions of high stringency, and in particular 
under stringent conditions (all as described above). Such nucleotide probes may for 
instance be used for detecting and/or isolating a(nother) nucleotide sequence of the 
invention and/or as a primer for amplifying a nucleotide sequence of the invention; all 
using techniques known per se, for which reference is again made to the general 
handbooks such as Sambrook et al. and Ausubel et al. mentioned above. 

Generally, such probes can be designed by the skilled person starting from a 
nucleotide sequence and/or amino acid sequence of the invention - and in particular one 
or more of the sequences of SEQ ID NOS 1, 3, 5, 7 and/or 9 - optionally using a suitable 
computer algorithm. Also, as will be clear to the skilled person, such probes may be 
degenerate probes. 

In another embodiment, the invention relates to an antisense molecule against a 
nucleotide sequence of the invention. 

Yet another embodiment relates to a double stranded RNA molecule directed 
against a nucleotide sequence of the invention (one strand of which will usually comprise 
at least part of a nucleotide sequence of the invention). The invention also relates to 
genetic constructs that can be used to provide such double stranded RNA molecules (e.g. 
by suitable expression in a host cell or host organism, or for example in a bacterial strain 
such as E.coli). For such constructs, reference is made to for example the International 
Applications PCT/IB01/1068 and WO 00/01846, both by applicant. 

The amino acid sequence of SEQ ID NO: 2 was identified, and can be 
derived/isolated from/using the nematode C.elegans; in the manner as further described 
in Berman et al., or in any other suitable manner known per se. 
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The amino acid sequences of SEQ ID NOS: 4, 6, 8 and 10 were identified, and 
can be derived/isolated from/using human cells; in the manner as further described in the 
prior art referred to above, or in any other suitable manner known per se. 

Generally, the amino acid sequences of SEQ ID NOS: 2, 4, 6, 8 and/or 10 maybe 
5 isolated from the species mentioned above (i.e. C. elegans and human, respectively) , 
using any technique(s) for protein isolation and/or purification known per se. 
Alternatively, the amino acid sequences of SEQ ID NOS: 2, 4, 6, 8 and/or 10 may be 
obtained by suitable expression of a suitable nucleotide sequence - such as one of the 
nucleotide sequences of SEQ ID NOS: 1, 3, 5, 7 and/or 9, as applicable or a suitable 
10 mutant thereof - in an appropriate host cell or host organism, as further described below. 

Also, it is expected that - based upon the disclosure herein - the skilled person will 
be able to identify, derive and/or isolate natural "analogs" (as mentioned above) of the 
amino acid sequences of SEQ ID NOS: 2, 4, 6, 8 and/or 10. Such mutants could be 
derived from (other individuals of) the same species (for example from an individual of a 
15 different strain or line, including but not limited to mutant strains or lines); and/or from 
(individuals of) other species. For example, such analogs could be derived from the insect 
species or other pest species mentioned above. 

Such natural analogs may again be obtained by isolating them from their natural 
source using any technique(s) for protein isolation and/or purification known per se, or 
20 alternatively by suitable expression of a suitable nucleotide sequence of the invention - 
such as a natural mutant as described above - in an appropriate host cell or host organism, 
as further described below. 

It is also expected that - based upon the disclosure herein - the skilled person will 
be able to provide and/or derive synthetic "analogs" (as mentioned above) of one or 
25 more of the amino sequences of SEQ ID NOS: 2, 4, 6, 8 and/or 10. 

Generally, such synthetic analogs may be obtained by suitable expression of a 
suitable nucleotide sequence of the invention - such as a synthetic mutant as described 
above - in an appropriate host cell or host organism, as further described below. 

Preferably, any analogs as described herein will have one or more, and preferably 
30 all, of the structural characteristics/conserved features referred to above for the sequences 
of SEQ ID NO: 2, 4, 6, 8 and/or 10, and in particular will contain a kinase domain. 
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It is also possible in the invention to use a part or fragment of one or more of the 
amino acid sequences of SEQ ID NOS 2, 4, 6, 8 and/or 10, or a part or fragment of a 
(natural or synthetic) analog thereof mutant thereof. This may for instance be N- and/or 
C- truncated amino acid sequence. Also, two or more parts or fragments of one or more 
5 amino acid sequences of the invention may be suitably combined to provide a (further) 
amino acid sequence of the invention. 

Preferably, any such parts or fragments will be such that they comprise at least 
one continuous stretch of at least 5 amino acids, preferably at least 10 amino acids, more 
preferably at least 20 amino acids, even more preferably more than 30 amino acids, of 
10 one or more of the amino acid sequences of SEQ ID NO: 2, 4, 6, 8 and/or 10. 

In particular, any parts or fragments as described herein are such that they (at 
least) comprise the active/catalytic site of the corresponding amino acid sequence of the 
invention and/or a binding domain of the corresponding amino acid sequence of the 
invention, and in particular a kinase domain. As will be clear to the skilled person, such 
1 5 parts or fragments may find particular use in assay- and screening techniques (as 

generally described below) and/or (when said part or fragment is provided in crystalline 
form) in X-ray crystallography. 

Generally, such parts or fragments of the amino acid sequences of the invention 
may be obtained by suitable expression of a suitable nucleotide sequence of the invention 
20 - such as a suitable part or fragment as described hereinabove for the nucleotide 
sequences of the invention - in an appropriate host cell or host organism, as further 
described below. 

In addition and/or as an alternative to the methodology above, amino acid 
sequences of the invention may also be provided by (chemically and/or enzymatically) 

25 modifying the side chain(s) of one or more amino acid residues of an amino acid 
sequence of SEQ ID NO: 2, 4, 6, 8 and/or 10 or a part, fragment, (natural and/or 
synthetic) mutant, variant, allele, analogs, orthologs thereof, for example by one or more 
of the side chain modifications as described in WO 01/02560 and/or by incorporating 
(e.g. by insertion and/or substitution) one or more unnatural amino acid residues, again as 

30 described in WO 01/02560. 
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Preferably, any analogs, parts and/or fragments as described herein will be such 
that they have a degree of "sequence identity", at the amino acid level, with one or more 
of the amino acid sequences of SEQ ID NOS 2, 4, 6, 8 and/or 10 of at least 50%, 
preferably at least 60%, more preferably at least 70%, even more preferably at least 80%, 
5 and in particular more than 90% and up to 95 % or more. 

For this purpose, the percentage of "sequence identity" between a given amino 
acid sequence and one of the amino acid sequences of SEQ ID NOS: 2, 4, 6, 8 and/or 10 
may be calculated by dividing [the number of amino acid residues in the given amino 
acid sequence that are identical to the amino acid residue at the corresponding position 
10 in the amino acid sequence of the relevant SEQ ID NO] by [the total. number of amino 
acid residues in the given amino acid sequence] and multiplying by [100%], in which 
each deletion, insertion, substitution or addition of an amino acid residue - compared to 
the sequence of the relevant SEQ ID NO - is considered as a difference at a single amino 
acid (position). 

1 5 Alternatively, the degree of sequence identity may be calculated using a known 

computer algorithm, such as those mentioned above. 

Also, such sequence identity at the amino acid level may take into account so- 
called "conservative amino acid substitutions", which are well known in the art, for 
example from GB-A-2 357 768, WO 98/49185, WO 00/46383 and WO 01/09300; and 

20 (preferred) types and/or combinations of such substitutions may be selected on the basis 
of the pertinent teachings from the references mentioned in WO 98/49185. 

Also, preferably, any analogs, parts and/or fragments as described herein will 
have a biological activity that is essentially similar to the biological activity described 
above for the sequences of SEQ ID NOS: 2, 4, 6, 8 and/or 10, i.e. to a degree of at least 

25 10%, preferably at least 50 %, more preferably at least 75%, and up to 90%, as measured 
by the assay mentioned above. In particular, they will have activity as a kinase, as 
measured using a suitable assay of kinase activity, such as those referred to above, and 
preferably an activity which is at least 10%, preferably at least 50 %, more preferably at 
least 75%, and up to 90% or more, of the kinase activity of any one of the sequences of 

30 SEQ ID NOS: 2, 4, 6, 8 and/or 10. 
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Preferably, an amino acid sequence of the invention will (also) have a length 
(expressed as total number of amino acid residues), which is at least 50%, preferably at 
least 60%, more preferably at least 70%, even more preferably at least 80%, and in 
particular more than 90% and up to 95 % or more of the length of one or more of the 
amino acid sequence of SEQ ID NOS: 2, 4, 6, 8 and/or 10. 

It is also within the scope of the invention to use a fusion of an amino acid 
sequence of the invention (as described above) with one or more further amino acid 
sequences, for example to provide a protein fusion. Generally, such fusions may be 
obtained by suitable expression of a suitable nucleotide sequence of the invention - such 
as a suitable fusion of a nucleotide sequence of the invention with one or more further 
coding sequences - in an appropriate host cell or host organism, as further described 
below. According to one non-limiting embodiment, such a fusion comprises at least the 
kinase domain of one of the amino acid sequences of the invention. 

In one particular embodiment, such fusions may comprise an amino acid sequence 
of the invention (that preferably at least comprises the kinase domain, and may 
essentially consist of the kinase domain) fused with a reporter protein such as GFP, 
luciferase or another fluorescent protein moiety. As will be clear to the skilled person, 
such fusions may find particular use in expression analysis and similar methodologies. 

In another embodiment, the fusion partner may be an amino acid sequence or 
residue that may be used in purification of the expressed amino acid sequence, for 
example using affinity techniques directed against said sequence or residue. Thereafter, 
said sequence or residue may be removed (e.g. by chemical or enzymatical cleavage) to 
provide the nucleotide sequence of the invention (for this purpose, the sequence or 
residue may optionally be linked to the amino acid sequence of the invention via a 
cleavable linker sequence). Some preferred, but non-limiting examples of such residues 
are multiple histidine residues and glutatione residues, 

In one preferred, but non-limiting aspect, any such fusion will have a biological 
activity that is essentially similar to the biological activity described above for the 
sequences of SEQ ID NOS: 2, 4, 6, 8 and/or 10, i.e. to a degree of at least 10%, 
preferably at least 50 %, more preferably at least 75%, and up to 90%, as measured by the 
assay mentioned above. In particular, they will have activity as a kinase, as measured 
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using a suitable assay of kinase activity, such as those referred to above, and preferably 
an activity which is at least 10%, preferably at least 50 %, more preferably at least 75%, 
and up to 90% or more, of the kinase activity of any one of the sequences of SEQ ID 
NOS: 2, 4, 6, 8 and/or 10. 

Genetic constructs of the invention will generally comprise at least one nucleotide 
sequence of the invention, optionally linked to one or more elements of genetic constructs 
known per se, as described below. 

Such genetic constructs may be DNA or RNA, and are preferably double-stranded 
DNA. The constructs may also be in a form suitable for transformation of the intended 
host cell or host organism, in a form suitable for integration into the genomic DNA of the 
intended host cell or in a form suitable for independent replication, maintenance and/or 
inheritance in the intended host organism. For instance, the genetic construct may be in 
the form of a vector, such as for example a plasmid, cosmid, YAC, a viral vector or 
transposon. In particular, the vector may be an expression vector, i.e. a vector that can 
provide for expression in vitro and/or in vivo (e.g. in a suitable host cell and/or host 
organism as described below). 

As the one or more "further elements" referred to above, the genetic construct(s) 
of the invention may generally contain one or more suitable regulatory elements (such as 
a suitable promoters), enhancer(s), terminators), etc.), 3'- or 5'-UTR sequences, leader 
sequences, selection markers, expression markers/reporter genes, and/or elements that 
may facilitate or increase (the efficiency of) transformation or integration. These and 
other suitable elements for such genetic constructs will be clear to the skilled person, and 
may for instance depend upon the type of construct used, the intended host cell or host 
organism; the manner in which the nucleotide sequences of the invention of interest are to 
be expressed (e.g. via constitutive, transient or inducible expression); and/or the 
transformation technique to be used. 

Preferably, in the genetic constructs of the invention, the one or more further 
elements are "operably linked* to the nucleotide sequence(s) of the invention and/or to 
each other, by which is generally meant that they are in a functional relationship with 
each other. For instance, a promoter is considered "operably UnkedT to a coding sequence 
if said promoter is able to initiate or otherwise control/regulate the transcription and/or 
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the expression of a coding sequence (in which said coding sequence should be 
understood as being "under the control of said promotor) 

Generally, when two nucleotide sequences are operably linked, they will be in the 
same orientation and usually also in the same reading frame. They will usually also be 
essentially contiguous, although this may also not be required. 

Preferably, the optional further elements of the genetic construct(s) used in the 
invention are such that they are capable of providing their intended biological function in 
the intended host cell or host organism. 

For instance, a promoter, enhancer or terminator should be "operable" in the 
intended host cell or host organism, by which is meant that (for example) said promoter 
should be capable of initiating or otherwise controlling/regulating the transcription and/or 
the expression of a nucleotide sequence - e.g. a coding sequence - to which it is operably 
linked (as defined above). 

Such a promoter may be a constitutive promoter or an inducible promoter, and 
may also be such that it (only) provides for expression in a specific stage of development 
of the host cell or host organism, and/or such that it (only) provides for expression in a 
specific cell, tissue, organ or part of a multicellular host organism. 

Some particularly preferred promoters include, but are not limited to those present 
in the expression vectors referred to below. 

A selection marker should be such that it allows - i.e. under appropriate selection 
conditions - host cells and/or host organisms that have been (successfully) transformed 
with the nucleotide sequence of the invention to be distinguished from host 
cells/organisms that have not been (successfully) transformed. Some preferred, but non- 
limiting examples of such markers are genes that provide resistance against antibiotics 
(such as kanamycine or ampicilline), genes that provide for temperature resistance, or 
genes that allow the host cell or host organism to be maintained in the absence of certain 
factors, compounds and/or (food) components in the medium that are essential for 
survival of the non-transformed cells or organisms. 

A leader sequence should be such that - in the intended host cell or host organism 
- it allows for the desired post-translational modifications and/or such that it directs the 
transcribed mRNA to a desired part or organelle of a cell. A leader sequence may also 
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allow for secretion of the expression product from said cell. As such, the leader sequence 
may be any pro-, pre-, or prepro-sequence operable in the host cell or host organism. 

An expression marker or reporter gene should be such that - in the host cell or 
host organism - it allows for detection of the expression of (a gene or nucleotide sequence 
5 present on) the genetic construct. An expression marker may optionally also allow for the 
localisation of the expressed product, e.g. in a specific part or organelle of a cell and/or in 
(a) specific cell(s), tissue(s), organ(s) orpart(s) of a multicellular organism. Such reporter 
genes may also be expressed as a protein fusion with the amino acid sequence of the 
invention. Some preferred, but non-limiting examples include fluorescent proteins such 

10 as Green Fluorescent Protein (GFP). 

For some (further) non-limiting examples of the promoters, selection markers, 
leader sequences, expression markers and further elements that may be present/used in 
the genetic constructs of the invention - such as terminators, transcriptional and/or 
translational enhancers and/or integration factors - reference is made to the general 

15 handbooks such as Sambrook et al. and Ausubel et al. mentioned above, to W.B. Wood et 
al., "The nematode Caenorhabditis elegans", Cold Spring Harbor Laboratory Press 
(1988) and D.L. Riddle et al., "C. ELEGANS //", Cold Spring Harbor Laboratory Press 
(1997), as well as to the examples that are given in WO 95/07463, WO 96/23810, WO 
95/07463, WO 95/21 191, WO 97/1 1094, WO 97/42320, WO 98/06737, WO 98/21355, 

20 US-A-6,207,410, US-A- 5,693,492 and EP 1 085 089. Other examples will be clear to the 
skilled person. 

The genetic constructs of the invention may generally be provided by suitably 
linking the nucleotide sequence(s) of the invention to the one or more further elements 
described above, for example using the techniques described in the general handbooks 
25 such as Sambrook et al. and Ausubel et al., mentioned above. 

Often, the genetic constructs of the invention will be obtained by inserting a 
nucleotide sequence of the invention in a suitable (expression) vector known per se. 
Some preferred, but non-limiting examples of suitable expression vectors include: 
vectors for expression in mammalian cells: pMAMneo (Clontech), pcDNA3 
30 (Invitrogen), pMClneo (Stratagene), pSG5 (Stratagene), EBO-pSV2-neo (ATCC 

37593), pBPV-1 (8-2) (ATCC 371 10), pdBPV-MMTneo (342-12) (ATCC 37224), 
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pRSVgpt (ATCC37199), pRSVneo (ATCC37198), pSV2-dhfr (ATCC 37146), 
pUCTag (ATCC 37460) and 1ZD35 (ATCC 37565); 

vectors for expression in bacterials cells: pET vectors (Novagen) and pQE vectors 
(Qiagen); 

vectors for expression in yeast or other fungal cells: pYES2 (Invitrogen) and Pichia 
expression vectors (Invitrogen); 

vectors for expression in insect cells: pBlueBacII (Invitrogen). 

The nucleotide sequences and/or genetic constructs of the invention may be used 
to transform a host cell or host organism. 

The host cell may be any suitable (fungal, prokaryotic or eukaryotic) cell or cell 
line, for example: 

a bacterial strain, including but not limited to strains of E.coli, Bacillus., 
Streptomyces and Pseudomonas; 

a fungal cell, including but not limited to cells from species of Aspergillus and 
Trichoderma; 

a yeast cell, including but not limited to cells from species of Kluyveromyces or 
Saccharomyces; 

an amphibian cell or cell line, such as Xenopus oocytes. 

In one specific embodiment, which may prove particularly useful when the 
nucleotide sequences of the invention are (to be) used in the discovery and development 
of insecticidal compounds, the host cell may be an insect-derived cell or cell line, such 
as: 

cells/cell lines derived from lepidoptera, including but not limited to Spodoptera SF9 
and Sf21 cells, 

cells/cell lines derived from Drosophila, such as Schneider and Kc cells; and/or 
cells/cell lines derived from a pest species of interest (as mentioned below), such as 
from Heliothis virescens. 

In one preferred embodiment, the host cell is a mammalian cell or cell line, for 
example derived from the mammals referred to above. 
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In an even more preferred aspect, the host cell is a cell or cell line derived from a 
human, and from other mammals including but not limited to CHO- and BHK-cells and 
human cells or cell lines such as HeLa and COS. 

In one specific, but non-limiting embodiment, the cell or cell line may be a human 
5 cell or cell line which is related to metabolic processes or metabolic disease and/or used 
as a cellular model for metabolic disease, including but not limited to liver cells or cell 
lines, adipocytes or muscle cells or cell lines such as HEPG2 cells, 3T3L1 adipocytes, 
CTC12 cells and L6 myotubes. 

The host organism may be any suitable multicellular (vertebrate or invertebrate) 
10 organism, including but not limited to: 

a nematode, including but not limited to nematodes from the genus Caenorhabditis, 
such as C.elegans, 

an insect, including but not limited to species of Drosophila and/or a specific pest 
species of interest (such as those mentioned above); 
15 - other well known model organisms, such as zebrafish; 
a mammal such as a rat or mouse; 

Other suitable host cells or host organisms will be clear to the skilled person, for 
example from the handbooks and patent applications mentioned above. 

It should be noted that when a nucleotide sequence of the invention is expressed 
20 in a multicellular organism, it may be expressed throughout the entire organism, or only 
in one or more specific cells, tissues, organs and/or parts thereof, for example by 
expression under the control of a promoter that is specific for said cell(s), tissue(s), 
organ(s) or part(s). 

The nucleotide sequence may also be expressed during only a specific stage of 
25 development or life cycle of the host cell or host organism, again for example by 

expression under the control of a promoter that is specific for said stage of development 
or life cycle. Also, as already mentioned above, said expression may be constitutive, 
transient and/or inducible. 

According to one specific embodiment, the expression of a nucleotide sequence of 
30 the invention in a host cell or host organism may be party or totally reduced (i.e. knocked 
out), compared to the original (e.g. native) host cell or host organism. This may for 
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instance be achieved in a transient manner using antisense and/or RNA-interference 
techniques well known in the art, or in a constitutive manner using random, site specific 
and/or chemical mutagenesis of the nucleotide sequence of the invention, or any other 
suitable techniques for generating "knock-down" or "knock-out" animals. 
5 Suitable transformation techniques will be clear to the skilled person and may 

depend on the intended host cell/host organism and the genetic construct to be used. 
Some preferred, but non-limiting examples of suitable techniques include ballistic 
transformation, (micro-)injection, transfection (e.g. using suitable transposons), 
electroporation and lipofection. For these and other suitable techniques, reference is again 

10 made to the handbooks and patent applications mentioned above. 

After transformation, a step for detecting and selecting those host cells or host 
organisms that have been succesfully transformed with the nucleotide sequence/genetic 
construct of the invention may be performed. This may for instance be a selection step 
based on a selectable marker present in the genetic construct of the invention or a step 

15 involving the detection of the amino acid sequence of the invention, e.g. using specific 
antibodies. 

The transformed host cell (which may be in the form or a stable cell line) or host 
organisms (which may be in the form of a stable mutant line or strain) form further 
aspects of the present invention. 

20 Preferably, these host cells or host organisms are such that they express, or are (at 

least) capable of expressing (e.g. under suitable conditions), an amino acid sequence of 
the invention (and in case of a host organism: in at least one cell, part, tissue or organ 
thereof). The invention also includes further generations, progeny and/or offspring of the 
host cell or host organism of the invention, that may for instance be obtained by cell 

25 division or by sexual or asexual reproduction. 

To produce/obtain expression of the amino acid sequences of the invention, the 
transformed host cell or transformed host organism may generally be kept, maintained 
and/or cultured under conditions such that the (desired) amino acid sequence of the 
invention is expressed/produced. Suitable conditions will be clear to the skilled person 

30 and will usually depend upon the host cell/host organism used, as well as on the 

regulatory elements that control the expression of the (relevant) nucleotide sequence of 
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the invention. Again, reference is made to the handbooks and patent applications 
mentioned above in the paragraphs on the genetic constructs of the invention. 

Generally, suitable conditions may include the use of a suitable medium, the 
presence of a suitable source of food and/or suitable nutrients, the use of a suitable 
5 temperature, and optionally the presence of a suitable inducing factor or compound (e.g. 
when the nucleotide sequences of the invention are under the control of an inducible 
promoter); all of which may be selected by the skilled person. Again, under such 
conditions, the amino acid sequences of the invention may be expressed in a constitutive 
manner, in a transient manner, or only when suitably induced. 

10 It will also be clear to the skilled person that the amino acid sequence of the 

invention may (first) be generated in an immature form (as mentioned above), which may 
then be subjected to post-translational modification, depending on the host cell/host 
organism used. Also, the amino acid sequence of the invention may be glycosylated, 
again depending on the host cell/host organism used. 

15 The amino acid sequence of the invention may then be isolated from the host 

cell/host organism and/or from the medium in which said host cell or host organism was 
cultivated, using protein isolation and/or purification techniques known per se, such as 
(preparative) chromatography and/or electrophoresis techniques, differential precipitation 
techniques, affinity techniques (e.g. using a specific, cleavable amino acid sequence 

20 fused with the amino acid sequence of the invention) and/or preparative immunological 
techniques (i.e. using antibodies against the amino acid sequence to be isolated). 

In one embodiment, the amino acid sequence thus obtained may also be used to 
generate antibodies specifically against said sequence or an antigenic part or epitope 
thereof. 

25 Such antibodies, which form a further aspect of the invention, may be generated 

in a manner known per se, for example as described in GB-A-2 357 768, US-A- 
5,693,492, WO 95/32734, WO 96/23882, WO 98/02456, WO 98/41633 and/or WO 
98/49306, and/or as described in the prior art referred to above. Often, but not exclusively, 
such methods will involve immunizing an immunocompetent host with the pertinent 

30 amino acid sequence of the invention or an immungenic part thereof (such as a specific 
epitope), in amount(s) and according to a regimen such that antibodies against said amino 
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acid sequence are raised, and than harvesting the antibodies thus generated, e.g. from 
blood or serum derived from said host. 

For instance, polyclonal antibodies can be obtained by immunizing a suitable host 
such as a goat, rabbit, sheep, rat, pig or mouse with (an epitope of) an amino acid sequence 
of the invention, optionally with the use of an immunogenic carrier (such as bovine serum 
albumin or keyhole limpet hemocyanin) and/or an adjuvant such as Freund's, saponin, 
ISCOM's, aluminium hydroxide or a similar mineral gel, or keyhole limpet hemocyanin or a 
similar surface active substance. After a suitable immune response has been raised (usually 
within 1-7 days), the antibodies can be isolated from blood or serum taken from the 
immunized animal in a manner known per se, which optionally may involve a step of 
screening for an antibody with desired properties (i.e. specificity) using known 
immunoassay techniques, for which reference is again made to for instance WO 96/23882. 

Monoclonal antibodies may, for example, be produced using continuous cell lines 
in culture, including hybridoma-based and similar techniques, again essentially as described 
in the above cited references. Accordingly, cells and cell lines that produce monoclonal 
antibodies against an amino acid sequence of the invention form a further aspect of the 
invention, as do methods for producing antibodies against amino acid sequences of the 
invention, which methods may generally involve cultivating such a cell and isolating the 
antibodies from the culture (medium), again using techniques known per se. 

Also, Fab-fragments against the amino acid sequences of the invention (such as 
F(ab)2, Fab* and Fab fragments) may be obtained by digestion of an antibody with pepsin or 
another protease, reducing disul fide-linkages and treatment with papain and a reducing 
agent, respectively. Fab-expression libraries may, for instance, be obtained by the method of 
Huse et al., 1989, Science 245:1275-1281. 

In another embodiment, the nucleotide sequences of the invention, the amino acid 
sequences of the invention, and/or a host cell or host organism that expresses such an 
amino acid sequence, may also be used in an assay or assay method generally (including 
but not limited to diagnostic assays and/or assays to determing the presence and/or 
absence of specific mutations and/or genetic markers, for example to determine 
susceptibility for a condition or disease associated with such a mutation or marker), and 
in particular in an assay to identify and/or (further) develop compounds and/or other 
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factors that can modulate the (biological) activity of, and/or that can otherwise interact 
with, the amino acid sequences of the invention, and such uses form further aspects of the 
invention. As will be clear to the skilled person, in this context, the amino acid sequence 
of the invention will serve as a target for interaction with such a compound or factor 

In this context, the terms "modulate", "modulation, "modulator" and "target" will 
have their usual meaning in the art, for which reference is inter alia made to the 
definitions given in WO 98/06737. Generally, a modulator is a compound or factor that 
can enhance, inhibit/reduce or otherwise alter, influence or affect (collectively referred to 
as "modulation") a functional property of a biological activity or process (for example, 
the biological activity of an amino acid sequence of the invention). 

In this context, the amino acid sequence of the invention may serve as a target for 
modulation in vitro (e.g. as part of an assay or screen) and/or for modulation in vivo (e.g. 
for modulation by a compound or factor that is known to modulate the target, which 
compound or factor may for example be used as an active compound for agrochemical, 
veterinary and/or pharmaceutical use). 

For example, the amino acid sequences, host cells and/or host organisms of the 
invention may be used as part of an assay or screen that may be used to identify and/or 
develop modulators of the amino acid sequence of the invention, such as a primary screen 
(e.g. a screen used to identify modulators of the target from a set or library of test 
chemicals with unknown activity with respect to the target) and/or a secondary assay (e.g. 
an assay used for validating hits from a primary screen and/or used in optimizing hit 
molecules, e.g. as part of hits-to-leads chemistry). 

For instance, such an assay or screen may be configured as an in vitro assay or 
screen, which will generally involve binding of the compound or factor to be tested as a 
potential modulator for the target (hereinbelow also referred to as "test chemical") to the 
target, upon which a signal generated by said binding is measured. Suitable techniques 
for such in vitro screening will be clear to the skilled person, and are for example 
described in Eldefrawi et ah, (1987). FASEB J., Vol.1, pages 262-271 and Rauh et al., 
(1990), Trends in Pharmacol. Sci., vol.1 1, pages 325-329. For example, such an assay or 
screen may be configured as a binding assay or screen, in which the test chemical is used 
to displace a detectable ligand from the target (e.g. a radioactive or fluorescent ligand), 
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upon which the amount of ligand displaced from the target by the modulator is 
determined. 

Assays aimed at identification of small molecule inhibitors of protein kinases are 
well known. There is a large number of technologies that can be classified in two main 
5 approaches: 

1 . Detection based on an antibody specifically recognizing the phosphorylated product 
(peptide) of the kinase action 

These approaches rely on the specificity of antibody to differentiate between the 
peptide containing a phosporylated amino acid vs. the non-phosporylated peptide. 

10 This recognition can utilize, for example, the following readouts: 

fluorescence polarization (FP), where the detection is based on the change of the 
overall size upon antibody binding to fluorescent labeled peptide, 
fluorescence intensity (FI), where the antibody binding causes change in the 
intensity of the emission from a fluorescently labeled peptide. 

15 - time-resolve fluorescence (where both the antibody and the peptide are labeled 

with different fluorescent labels and the binding leads to energy transfer between 
these two labels. This in turn leads to shortening of the donor label fluorescence 
lifetime. These assays have at least two different embodiments: biochemical or 
cell-based, via enzyme complementation (the latter is marketed as HitHunter 

20 (TM), by Discoverex) . 

2. Detection based on other reagents specifically recognizing phosphate groups. 
Recently, at least three new techniques have been developed that do not rely on the 
use of specific antibody. 

Trivalent metal ion containing beads. These bind specifically to phosphate 
25 groups. The detection is than performed by change in fluorescence polarization, 

the same as above, upon antibody binding. (IMAP(TM), Molecular Devices)) 
based on chemical modification of the phosphate group and a specific label in the 
substrate to yield a change in fluorescence intensity. (IQ(TM), Pierce) 
based on chemical modification of the phosphate group to yield a biotin 
30 derivative. The detection is performed via FP upon avidin or strepavidin binding. 

(Caliper)" 
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Such an assay or screen may also be configured as a cell-based assay or screen, in 
which a host cell of the invention is contacted with/exposed to a test chemical, upon 
which at least one biological response by the host cell is measured. 

Suitable cells or cell lines for such cell based assays include those mentioned 
5 above. In one preferred, but non-limiting embodiment, the cell or cell line may be a 
mammalian, and in particular human, cell or cell line which is related to metabolic 
processes or metabolic disease and/or used as a cellular model for metabolic disease, 
including but not limited to liver cells or cell lines, adipocytes or muscle cells or cell lines 
such as HEPG2 cells, 3T3L1 adipocytes, CTC12 cells and L6 myotubes. 

10 Also, such an assay or screen may also be configured as a whole animal screen, 

in which a host organism of the invention is contacted with/exposed to a test chemical, 
upon which at least one biological response (such as a phenotypical, behavioural 
and/orphysiological change, including but not limited to paralysis or death) by the host 
organism is measured. Such screens may be carried out in any model organism known 

15 per se, including but not limited to yeast, Drosophila, zebrafish or C. elegans. 

Thus, generally, the assays and screens described above will comprise at least one 
step in which the test chemical is contacted with the target (and/or with a host cell or host 
organism that expresses the target), and in particular in such a way that a signal is 
generated that is representative for the modulation of the target by the test chemical 

20 (including modulation of the interaction of the target protein with any other proteins. 
Suitable techniques for screening such protein-protein interactions, such as FRET, SPA 
and BRET, will be clear to the skilled person). In a further step, said signal may then be 
detected. 

Accordingly, in one aspect, the invention relates to a method for generating a 
25 signal that is representative for the interaction of an amino acid sequence of the invention 
with a test chemical, said method at least comprising the steps of: 

a) contacting the amino acid sequence of the invention, or a host cell or host organism 
containing/expressing an amino acid sequence of the invention, with said test 
chemical, in such a way that a signal may be generated that is representative for the 

30 interaction between said test chemical and said amino acid sequence; and optionally 

b) detecting the signal that may thus be generated. 
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In another aspect, the invention relates to a method for identifying modulators of 
an amino acid sequence of the invention (e.g. from a set or library of test chemicals), said 
method at least comprising the steps of: 

a) contacting the amino acid sequence of the invention, or a host cell or host organism 
5 containing/expressing an amino acid sequence of the invention, with a test chemical, 

in such a way that a signal may be generated that is representative for the interaction 
between said test chemical and said amino acid sequence; and optionally 

b) detecting the signal that may thus be generated, said signal identifying a modulator of 
said amino acid sequence. 

10 Compounds that may be tested using the methods of the invention are generally 

described below. 

The assays and screens of the invention may be carried out at medium throughput 
to high throughput, for example in an automated fashion using suitable robotics. In 
particular, in this embodiment, the method of the invention may be carried out by 

15 contacting the target with the test compound in a well of a multi-well plate, such as a 
standard 24, 96, 384, 1536 or 3456 well plate. 

Usually, in a screen or assay of the invention, for each measurement, the target or 
host cell/host organism will be contacted with only a single test compound. However, it is 
also within the scope of the invention to contact the target with two or more test 

20 compounds - either simultaneously or sequentially - for example to determine whether 
said combination provides a synergistic effect. 

Once a test chemical has been identified as a modulator for an amino acid sequence of the 
invention (e.g. by means of a screen or assay as described hereinabove), it may be used 
per se as a modulator of the amino relevant amino acid sequence of the invention (e.g. as 

25 an active substance for pharmaceutical use), or it may optionally be further optimized for 
final use, e.g. to improve properties such as solubility, ADME-TOX (see for example 
http://www.scherago.com/admet/ or http://www.lifesciencesinfo.com/admet) and other 
desired properties. It will be clear to the skilled person that the nucleotide sequences, 
amino acid sequences, host cells/host organisms and/or methods of the invention may 

30 find further use in such optimization methodology, for example as (part of) secondary 
assays. 



WO 2004/056982 PCT/EP2003/014674 

32 

The invention is not particularly limited to any specific manner or mechanism 
in/via which the modulator (e.g. the test chemical, compound and/or factor) modulates, or 
interacts with, the target (in vivo and/or in vitro). For example, the modulator may 
comprise an agonist, an antagonist, an inverse agonist, a partial agonist, a competitive 
5 inhibitor, a non-competitive inhibitor, a cofactor, an allosteric inhibitor or other allosteric 
factor for the target, and/or may be a compound or factor that enhances or reduces 
binding of target to another biological component associated with its (biological) activity, 
such as another protein or polypeptide, a receptor, or a part or organelle of a cell. As 
such, the modulator may bind with the target (at the active site, at an allosteric site, at a 
10 binding domain and/or at another site on the target, e.g. covalently or via hydrogen 

bonding), block the active site of the target (in a reversible, irreversible or competitive 
manner), block a binding domain of the target (in a reversible, irreversible or competitive 
manner), and/or influence or change the conformation of the target. 
As such, the test chemical/modulator may for instance be: 
15 - an analog of a known substrate of the target; 

an oligopeptide, e.g. comprising between 2 and 20, preferably between 3 and 15 
amino acid residues; 

an antisense or double stranded RNA molecule; 
a protein, polypeptide; 
20 - a cofactor or an analog of a cofactor. 

Preferably, the compound is an inhibitor of the target, although the invention in its 
broadest sense is not limited thereto. 

The test chemical/modulator may also be a reference compound or factor, which 
may be a compound that is known to modulate or otherwise interact with the target (e.g. a 
25 known substrate or inhibitor for the target) or a compound or factor that is generally 
known to modulate or otherwise interact with other members from the general class to 
which the target belongs (e.g. a known substrate or inhibitor of said class). 

Preferably, however, the compound(s) will be "small molecules", by which is 
generally meant herein a molecular entity with a molecular weight of less than 1500, 
30 preferably less than 1000. This may for example be an organic, inorganic or 

organometallic molecule, which may also be in the form or a suitable salt, such as a 
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water-soluble salt; and may also be a complex, chelate and/or a similar molecular entity, 
as long as its (overall) molecular weight is within the range indicated above. 

In a preferred embodiment, such a "small molecule" has been designed according, 
and/or meets the criteria of, at least one, preferably at least any two, more preferably at 
5 least any three, and up to all of the so-called Lipinski rules for drug likeness prediction 
(vide Lipinksi et al., Advanced Drug Delivery Reviews 23 (1997), pages 3-25). As is 
known in the art, small molecules which meet these criteria are particularly suited (as 
starting points) for the design and/or development of pharmaceuticals for human use, and 
may for instance be used as starting points for hits-to-leads chemistry, and/or as starting 

10 points for lead development (in which the methods of the invention may also be applied). 

Also, for these purposes, the design of such small molecules (as well as the design 
of libraries consisting of such small molecules) will preferably also take into account the 
presence of pharmacophore points, for example according to the methods described by I. 
Muegge et al., J. Med. Chem. 44, 12 (2001), pages 1-6 and the documents cited herein. 

15 The term "small peptide " generally covers (oligo)peptides that contain a total of 

between 2 and 35, such as for example between 3 and 25, amino acids (e.g. in one or 
more connected chains, and preferably a single chain). It will be clear that some of these 
small peptides will also be included in the term small molecule as used herein, depending 
on their molecular weight. 

20 In one preferred, but non-limiting embodiment, the invention is used to screen a 

set or library of (related or otherwise unrelated) small molecules, for example a standard 
"robustness set", a primary screening library (e.g. of otherwise unrelated compounds), a 
combinatorial library, a series of closely related chemical analogos. Such sets or libraries 
will be clear to the skilled person, and may for instance include, but are not limited to, 

25 such commercially available chemical libraries such as the various libraries available 
from Tocris Cookson, Bristol, UK. 

In yet a further aspect, the invention relates to a method for identifying a 
compound that can be used in (the preparation of a pharmaceutical composition for) the 
prevention and/or treatment of metabolic diseases (e.g. from a set or library of test 

30 chemicals), said method at least comprising the steps of: 
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a) contacting an amino acid sequence of the invention, and/or a host cell or host 
organism containing/expressing an amino acid sequence of the invention, with a test 
chemical, in such a way that a signal may be generated that is representative for the 
interaction between said test chemical and said amino acid sequence of the invention; 

5 and optionally 

b) detecting the signal that may thus be generated, said signal identifying a modulator of 
said amino acid sequence. 

The modulators thus identified can be used in (the preparation of a pharmaceutical 
composition for) the prevention and/or treatment of metabolic diseases, and/or can be 
10 used to develop other compounds that can be used in (the preparation of a pharmaceutical 
composition for) the prevention and/or treatment of metabolic diseases, i.e. as already 
outlined above. 

The invention also relates to the use of an amino acid sequence and/or a host 
cell/host organism of the invention in such a method. 

15 Also, as already mentioned above, the use of the human nucleotide sequences of 

SEQ ID NOS: 3, 5, 7 and/or 9 and/or of sequences derived therefrom (such as mutants, 
parts, fragments and/or fusions thereof as described hereinabove), of the human amino 
acid sequences of SEQ ID NOS: 4, 6, 8 and/or 10 and/or of sequences derived therefrom 
(such as analogs, parts, fragments, and/or fusions thereof as described hereinabove), and 

20 of host cells/host organisms containing/expressing these, are usually preferred, in 
particular when the invention is used to develop compounds for pharmaceutical use. 

As already mentioned above, the compounds and/or factors that have been 
identified and/or developed as modulators of the amino acid sequences of the invention 
(and/or precursors for such compounds) may be useful as active substances in the 

25 pharmaceutical field, for example in the preparation of pharmaceutical compositions, and 
both such modulators as well as (pharmaceutical) compositions containing them comprise 
further aspects of the invention. 

In particular, the compounds and composition of the invention may be used in 
(the preparation of pharmaceutical compositions for) the prevention (e.g. prophylaxis) 

30 and/or treatment of metabolic diseases (which for the purposes herein in its broadest 
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sense also includes preventing, treating and/or alleviating the symptoms and/or 
complications of such metabolic diseases). 

In particular, such compounds and composition may be used in (the preparation of 
pharmaceutical compositions for) the prevention (e.g. prophylaxis) and/or treatment of 
5 metabolic diseases (which for the purposes herein in its broadest sense also includes 
preventing, treating and/or alleviating the symptoms and/or complications of such 
metabolic diseases). 

In particular, the compounds and compositions of the invention may be used for 
preventing and/or treating: 
10 - hyperglycemic conditions and/or other conditions and/or diseases that are (primarily) 
associated with (the response or sensitivity to) insulin, including but not limited to all 
forms of diabetes and disorders resulting from insulin resistance, such as Type I and 
Type II diabetes, as well as severe insulin resistance, hyperinsulinemia, and 
hyperlipidemia, e.g., obese subjects, and insulin-resistant diabetes, such as 
15 Mendenhall's Syndrome, Werner Syndrome, leprechaunism, lipoatrophic diabetes, 

and other lipoatrophies; 

conditions caused or usually associated with hyperglycemic conditions and/or 
obesity, such as hypertension, osteoporosis and/or lipodystrophy, 
so-called "metabolic syndrome" (also known as "Syndrome X") which is a condition 
20 where several of the following conditions coexist: hypertension; insulin resistance; 

diabetes; dyslipidemia; and/or obesity. 

In particular, the compounds and compositions of the invention may be used for 
preventing and/or treating diabetes, especially Type I and Type II diabetes. "Diabetes" 
itself refers to a progressive disease of carbohydrate metabolism involving inadequate 
25 production or utilization of insulin and is characterized by hyperglycemia and glycosuria. 
Also, as mentioned above, the amino acid sequences of the invention and in 
particular the nucleotide sequences of the invention, and more in particular the human 
amino acid sequences and nucleotide sequences of the invention may be used for 
diagnostic purposes, for example as part of diagnostic assays and/or as part of kits for 
30 performing such assays (in which such a kit will comprise at least a nucleotide sequence 
of the invention) may be suitably packaged (e.g. in a suitbale container) and may 
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optionally further comprise one or more elements for such kits known per se, such as 
suitable reagents, buffers or other solvents, and instructions for use). 

In particular, the amino acid sequences and nucleotide sequences of the invention, 
as well as assays and kits using such sequences, may be used for diagnostic purposes 
5 relating to one or more of the metabolic diseases specified, for example as assays to 
determine the presence and/or absence in an individual of specific mutations and/or 
genetic markers that relate to one or more of the metabolic diseases specified above, to 
determine the susceptibility and/or any predisposition to any of said metabolic diseases in 
an individual, to determine if any genetically determined factors contribute or even cause 

10 (in full or in part) said metabolic disease in an individual, determine and/or to confirm the 
kind of metabolic disease from which an individual suffers, and/or to predict the further 
progress of such a metabolic disease in an individual It will also be clear that any results 
obtained using such a diagnostic method or assay may also provide guidance to the 
clinician as to how such a metabolic disease should be treated in an individual, e.g. which 

15 medication should be prescribed and/or the doage regimen to be used, 

It should also be noted that, for the treatment of metabolic disease in humans, the 
compound used will usually and preferably be an inhibitor of an amino acid sequence of 
the invention, although the invention is its broadest sense is not limited thereto. 

In one specific, but non-limiting, embodiment of the invention, a compound is 

20 considered an inhibitor of one of the amino acid sequences of the invention if, in a 
relevant assay such as the kinase activity assays referred to above (or a suitable 
modification thereof, for example using partially or fully purified protein), said 
compound reduces the activity of said amino acid sequence, i.e. by at least 1%, preferably 
at least 10%, such as by 20% or more, compared to the activity without the presence of 

25 said compound. 

In an even more specific, but non-limiting, embodiment of the invention, a 
compound is considered an inhibitor of one of the amino acid sequences of the invention 
if, in a relevant assay, such as a binding assay, said compound has an IC50 value of less 
than 1000 jzM, preferably at than 500 jxM, more preferably less than 250j*M, even more 

30 preferably less than 100 jzM, for example 50 fxM or less, such as about 10 /iM or less. 
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Again, preferably, in the invention compounds are used that are modulators, and 
in particular inhibitors, of the human amino acid sequences of SEQ ID NO: 4, 6, 8 and/or 
10, and/or of amino acid sequences derived therefrom, such as analogs, mutants, parts, 
fragments and/or fusions as described above. 
5 For pharmaceutical use, the compounds of the invention may be used as a free 

acid or base, and/or in the form of a pharmaceutically acceptable acid-addition and/or 
base-addition salt (e.g. obtained with non-toxic organic or inorganic acid or base), in the 
form of a hydrate, solvate and/or complex, and/or in the form or a pre-drug, such as an 
ester. Such salts, hydrates, solvates, etc. and the preparation thereof will be clear to the 

10 skilled person; reference is for instance made to the salts, hydrates, solvates, etc. 
described in US-A-6,372,778, US-A-6,369,086 and US-6,369,067 

Generally, for pharmaceutical use, the compounds of the inventions may be 
formulated as a pharmaceutical preparation comprising at least one compound of the 
invention and at least one pharmaceutically acceptable carrier, diluent or excipient and/or 

15 adjuvant, and optionally one or more further pharmaceutically active compounds. By 
means of non-limiting examples, such a formulation may be in a form suitable for oral 
administration, for parenteral administration (such as by intravenous, intramuscular or 
subcutaneous injection or intravenous infusion), for topical administration, for 
administration by inhalation, by a skin patch, by an implant, by a suppository, etc. Such 

20 suitable administration forms - which may be solid, semi-solid or liquid, depending on 
the manner of administration - as well as methods and carriers for use in the preparation 
thereof, will be clear to the skilled person; reference is again made to for instance US-A- 
6,372,778, US-A-3,696, 086 and US-6,369,067. 

The pharmaceutical preparations of the invention are preferably in a unit dosage 

25 form, and may be suitably packaged, for example in a box, blister, vial, bottle, sachet, 
ampoule or in any other suitable holder or container (which may be properly labeled); 
optionally with one or more leaflets containing product information and/or instructions 
for use. Generally, such unit dosages will contain between 1 and 500 mg of the at least 
one compound of the invention, e.g. about 10, 25, 50, 100, 200, 500 or 1000 mg per unit 

30 dosage. 
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For pharmaceutical use, at least one compound of the invention will generally be 
administered in an amount of between 0.01 to 1 50 mg/kg body weight per day of the 
patient, divided over one or more daily doses. The amount(s) to be administered and the 
further treatment regimen may be determined by the treating clinician, depending on 
5 factors such as the age, gender and general condition of the patient and the nature and 
severity of the disease/symptoms to be treated. 

Thus, in a further aspect, the invention relates to a composition, and in particular a 
composition for pharmaceutical use, that contains at least one compound of the invention 
(i.e. a compound that has been identified, discovered and/or developed using a nematode 

10 or method as described herein) and at least one suitable carrier (i.e. a carrier suitable for 
pharmaceutical use). The invention also relates to the use of a compound of the invention 
in the preparation of such a composition. 

Preferably, the compounds and compositions of the invention are administered 
orally and/or in a form intended and/or suitable for oral administration. 

15 It is also envisaged that the above compounds and compositions may be of value 

in the veterinary field, which for the purposes herein not only includes the prevention 
and/or treatment of diseases in animals, but also - for economically important animals 
such as cattle, pigs, sheep, chicken, fish, etc. - enhancing the growth and/or weight of the 
animal and/or the amount and/or the quality of the meat or other products obtained from 

20 the animal. Thus, in a further aspect, the invention relates to a composition for veterinary 
use that contains at least one compound of the invention (i.e. a compound that has been 
identified, discovered and/or developed using a nematode or method as described herein) 
and at least one suitable carrier (i.e. a carrier suitable for veterinary use). The invention 
also relates to the use of a compound of the invention in the preparation of such a 

25 composition. 

In the agrochemical field, the invention may be used to identify compounds 
suitable for use in pesticides, insecticides, nematicides and/or other biocides or plant 
protection agents. For example, the compounds of the invention may be used to control 
the species listed in US-A-6,372,774. For this purpose, the compounds of the invention 
30 (or a suitable salt, hydrate or ester thereof) may be suitably formulated with one or more 
agrochemically acceptable carriers, to provide a formulation suitable for agrochemical 
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use, as will be clear to the skilled person (reference is, for example, made to the 
formulations and uses described in US-A-6,372,774). 

Thus, in a further aspect, the invention relates to a composition for agrochemical 
use that contains at least one compound of the invention (i.e. a compound that has been 
identified, discovered and/or developed using a nematode or method as described herein) 
and at least one suitable carrier (i.e. a carrier suitable for agrochemical use). The 
invention also relates to the use of a compound of the invention in the preparation of such 
a composition. 

The invention will now be further illustrated by means of the following non- 
limiting Experimental Part. 
In the Figures: 

Figure 1 schematically shows vector pGN49A (see also also WO 00/01846 and 
British patent application 0012233, both by Applicant); 

Figures 2A and 2B are photographs (enhanced using the Scion Image (Scion Corp) 
software package) showing reduced fat-absorption phenotype in C. elegans upon 
Nile Red Staining: Figure 2A = reduced fat storage (T17E9.1a- invention); Figure 2B 
= reference (vector gGN29 without T17E9.1a). 

Experimental part: 

In the Experimental Part below, unless indicated otherwise, all steps for handling 
and cultivating C. elegans were performed using standard techniques and procedures, for 
which reference is made to the standard G elegans handbooks, such as W.B. Wood et al., 
"The nematode Caenorhabditis elegans", Cold Spring Harbor Laboratory Press (1988); 
D.L. Riddle et al., "C ELEGANS ZT\ Cold Spring Harbor Laboratory Press (1997); 
"Caenorhabditis elegans, Modern Biological analysis of an organism": ed. by H. Epstein 
and D. Shakes, Methods in Cell Biology, Vol 48, 1995; and "C. elegans, a practical 
approach" , ed. by LA. Hope, Oxford University Press Inc. New York, USA, 1999. 

Downregulation of the gene(s) of interest in C elegans was achieved by RNAi 
feeding techniques using an E.coli strain capable of expressing a dsRNA corresponding 
to the gene(s) of interest, as generally described in - inter alia - the International 
application WO 00/01 846 by applicant and the handbooks referred to above. 
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Also, unless indicated otherwise, all cloning and other molecular biology steps 
were performed using standard techniques and protocols, i.e. as provided by the 
manufacturers of the reagents/kits used and/or as described in the standard handbooks, 
such as Sambrook et al, "Molecular Cloning: A Laboratory Manual" ( 2nd.ed.), Vols. 1-3, 
5 Cold Spring Harbor Laboratory Press (1989) and F. Ausubel et al, eds., "Current 

protocols in molecular biology", Green Publishing and Wiley Interscience, New York 
(1987). 

Fat accumulation in C. elegans daf-2 (el 370) was determined visually under a 
microscope upon staining with Nile-red, using an adaptation of the general methodology 

10 described by Ogg et al., Nature, Vol. 389, 994 (1 997). For the general methodology, 
reference is also made to Thaden et al., 1999 International Worm Meeting abstract 837; 
Ashrafi and Ruvkun, 2000 East Coast Worm Meeting abstract 67; Ashrafi, Chang and 
Ruvkun, 2001 International Worm Meeting abstract 325; and Rottiers and Antebi, 2001 
International Worm Meeting abstract 620 (all abstracts available from Worm Literature 

1 5 Index at http://elegans.swmed.edu/wli/). 

Example 1: Preparation E.coli RNA feeding strain for expression of T17E9.1a double 
stranded RNA. 

A vector for expression of dsRNA for downregulation of C. elegans gene 
20 T17E9. la (kin-18) was prepared as follows. 

The DNA fragment of SEQ ID NO: 11, which corresponds to 1576 nucleotides 
(26 % compared to the genomic sequence) of the C. elegans T17E9.1a gene, was 
obtained by PCR from genomic C. elegans DNA, using the following primers: 

25 - forward primer : CGAAAACCAGCAGAAGAGCG [SEQ ID NO: 1 2] 

- reverse primer : TCGAGGCATGTTCAGACTTCG [SEQ ID NO: 1 3] 



30 



This fragment was inserted in the Sr/7-site of expression vector pGN49a pGN49A 
(Figure 1, see also WO 00/01846 and British patent application 0012233, both by 
Applicant). This vector contains two T7 promoters flanking the Sr/I-s'\te, allowing 
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transcription of a nucleotide sequence inserted into said S//T-site into double stranded 
RNA, upon binding of a T7 polymerase to said promoter (vide WO 00/01846). 

The resulting vector, designated pGN49A-T17E9.1a, was transformed overnight 
into E. coli strain AB 309-105 (see EP-A-1 093 526 by applicant, page 17.). 
5 To normalize the culture, 250 ^1 of the overnight culture (1 ml) was transferred to 

a 96 well plate and the OD at 600 nm was measured (Fluostar Galaxy plate reader BMG), 
the remaining 750 ^1 centrifuged down. Next the pellet was re-suspended in S-complete 
fed ( S-complete supplemented with O.lmg/ml ampiciline and 1 mM IPTG) and volume 
adjusted to obtain ODeoo value of 1 

10 

Example 2: Generation of fat storage phenotype in C. elegans. 

In this example, C elegans strain CB1370 containing the temperature sensitive 
daf-2 allele e-1370 is used (Ogg et al., supra). CB 1370 is publicly available from, for 
example, the Caenorhabditis Genetics Center (CGC), Minnesota, USA). 

15 To generate the fat-storage phenotype, LI worms of strain CB 1370 were 

cultivated at a temperature of 15 °C in S-Complete fed-medium in the wells of a 96 well 
plate (1 LI nematodes per well) under essentially synchronized conditions, until the 
nematodes reached the L2 stage. 

Then, the temperature was increased to 20°C, and the worms were further 

20 cultivated at said temperature until their Fl offspring reached the L4 stage (about 144 

hrs). Due to the presence of the daf-2 allele e-1370, this raise in temperature from 15°C to 
20°C causes the nematodes to accumulate fat, mainly in their intestinal and hypodermal 
tissue (vide Ogg et al. and Figures 2A and 2B). 

The accumulation of fat (in the form of droplets) was made visible by means of 

25 Nile Red staining: L4 animals were washed several times with M9 (supplemented with 
0. 1% PEG) to remove the remaining E.coli, and fixed with MeOH (fc. 33%). After 
fixation the nematodes were stained with nile red (fc 0.375 mM in 37.5% MeOH) for 4 
hours. MeOH and excess dye was removed through several washes with M9 
(supplemented with 0.1% PEG). The staining pattern was visualized under UV using a 

30 5 00 nm long pass filter. 
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For testing the influence of the gene T17E9.1a on fat storage, during the steps 
described above, the worms were grown on 30 jil of the normalized E.coli pGN49A- 
T17E9.1a strain of Example 1 (OD 6 oo = 1) as a food source. As a reference, the daf-2 
(el370) nematodes were grown in a similar manner, but with E. coli strain AB 309-105 
5 containing vector pGN29A without the T17E9.1a insert as a food source, used in the 
same amount. All samples were carried out in quadruplicate. 

The results were as follows: worms fed on E.coli pGN49A-T17E9.1a strain, 
which downregulates the expression of T17E9.1a through RNA interference, showed a 
strong reduction of the accumulation of fat, compared to the reference (vide Figures 2A 
10 and 2B). 

These results show that T17E9.1a is involved in the regulation of the (daf-2 
dependent) ^/-pathway and the accumulation of fat in the nematode. It is known in the 
art that both are models for insulin resistance and fat handling in mammals, such as 
humans. 

15 Bio-informatic analysis of the sequence of T17E9.1a (kin-18) showed that the 

amino acid sequence of the kin-18 protein (SEQ ID NO:2) has the following degree of 
sequence identity (in percentages at the amino acid level) with the human amino acid 
sequences of SEQ ID NOS: 4, 6, 8 and 10, respectively: 

20 - SEQ ID NO:4 ("JIK"): overall sequence: 36%; kinase domain: 61% 

SEQ ID NO:6 ("PSK"): overall sequence: 24%; kinase domain: 59% 

SEQ ID NO:8 (?TA01")\ overall sequence: 32%; kinase domain: 59% 

SEQ ID NO: 1 0 ("Q9P2I6"): overall sequence: 33%; kinase domain: 6 1 % 



25 
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Example 3: Screening of SEP ID NO. 4 ("JHCY 

A protein with the amino acid sequence of SEQ ID NO: 4, isolated and purified in 
a manner known per se, were used to screen a reference set of 96 commercially available 
kinase inhibitors (obtained from several commercial sources such as Acros Organics, 
5 Belgium/USA; Alexis Biochemicals, UK; Biomol, USA; Calbiochem, USA; ICN 

Biomedicals; Sigma; and Tocris, UK), using the commercially available Kinase Glo® 
Luminescent Kinase Assay (Promega, USA), essentially according to the instructions of 
the manufacturer. 

In this assay, 6 out of 96 compounds showed a percentage which differed more 
10 than 3 Standard Deviation (SD) from 0 (zero) inhibition. These 6 compounds were 
designated as hits, corresponding with a hit-rate of about 6 %.. 

Example 4: Screening of SEP ID NO. 8 ("TAPD. 

A protein with the amino acid sequence of SEQ ID NP: 8, isolated and purified in 

15 a manner known per se, were used to screen a reference set of 96 commercially available 
kinase inhibitors (obtained from several commercial sources such as Acros Organics, 
Belgium/USA; Alexis Biochemicals, UK; Biomol, USA; Calbiochem, USA; ICN 
Biomedicals; Sigma; and Tocris, UK), using the commercially available Kinase Glo® 
Luminescent Kinase Assay (Promega, USA), essentially according to the instructions of 

20 the manufacturer. 

In this assay, 6 out of 96 compounds showed a percentage which differed more 
than 3 Standard Deviation (SD) from 0 (zero) inhibition. These 6 compounds were 
designated as hits, corresponding with a hit-rate of about 6 %. 
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CLAIMS: 

1. Nucleic acid, in essentially isolated form, said nucleic acid encoding one of 
the amino acid sequences of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 
8 and/or SEQ ID NO: 10, or an analog, variant, allele, ortholog, part and/or fragment of 
one of the amino acid sequences of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ 
ID NO: 8 and/or SEQ ID NO: 10. 

2. Nucleic acid, in essentially isolated form, said nucleic acid comprising one of 
the nucleotide sequences of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 

7 and/or SEQ ID NO: 9, or a mutant, variant, allele, analog, ortholog, part and/or 
fragment thereof. 

3. Genetic construct, comprising the nucleic acid of claim 1 and/or of claim 2, 
and optionally one or more further elements of genetic constructs known per se. 

4. Host cell or host organism, transformed with and/or containing a nucleic acid 
according to claim 1 and/or claim 2 and/or a genetic construct according to claim 3. 

5. Host cell or host organism, (capable of) expressing and/or producing one of 
the amino acid sequences of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 

8 and/or SEQ ID NO: 10 or an analog, variant, allele, analog, ortholog, part and/or 
fragment thereof. 

6. Method for producing one of the amino acid sequences of SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8 and/or SEQ ID NO: 10, or an analog, 
variant, allele, ortholog, part and/or fragment thereof, said method comprising at least the 
steps of: 

a) expressing of a nucleic acid according to claim 1 and/or 2 in a suitable host cell or 
host organism; and optionally 
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b) isolating the amino acid sequence thus expressed. 

7. Method for producing one of the amino acid sequences of SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8 and/or SEQ ID NO: 10, or an analog, 

5 variant, allele, ortholog, part and/or fragment thereof, said method comprising at least the 
steps of: 

a) maintaining and/or cultivating a host cell or host organism according to claim 4 or 5 
under conditions such that said host cell or host organism expresses or produces one 
of the amino acid sequences of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ 

10 ID NO: 8 and/or SEQ ID NO: 10, or an analog, variant, allele, ortholog, part and/or 

fragment thereof; and optionally 

b) isolating the amino acid sequence thus expressed/produced. 

8. Protein or polypeptide, in essentially isolated form, said protein or polypeptide 
15 comprising one of the amino acid sequences of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 

NO: 6, SEQ ID NO: 8 and/or SEQ ID NO: 10, or an analog, variant, allele, ortholog, part 
and/or fragment thereof. 

9. Method for generating a signal that is representative for the interaction of a 
20 protein or polypeptide of claim 8 with a test chemical, said method at least comprising 

the steps of: 

a) contacting a protein or polypeptide of claim 8, or a host cell or host organism of claim 
4 or 5, with said test chemical, in such a way that a signal may be generated that is 
representative for the interaction between said test chemical and said protein or 

25 polypeptide; and optionally 

b) detecting the signal that may thus be generated. 

10. Method for identifying a modulator of a protein or polypeptide of claim 8, for 
example from a set or library of test chemicals, said method at least comprising the steps 

30 of: 
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a) contacting a protein or polypeptide of claim 8, or a host cell or host organism of claim 
4 or 5, with said test chemical, in such a way that a signal may be generated that is 
representative for the interaction between said test chemical and said protein or 
polypeptide; and optionally 

b) detecting the signal that may thus be generated, said signal identifying a modulator of 
said amino acid sequence. 

1 1 . Modulator of a protein or polypeptide according to claim 8, identified and/or 
developed using a nucleic acid according to claim 1 and/or 2, a host cell according to 
claim 4 and/or 5, a protein or polypeptide according to claim 8, and/or a method 
according to claim 9 and/or 10. 

12. Modulator according to claim 1 1, being an inhibitor of a protein or 
polypeptide of claim 8. 

13. Pharmaceutical composition, comprising at least one modulator according to 
claim 1 1 and/or 12 and at least one pharmaceutical^ acceptable carrier. 

14. Pharmaceutical composition according to claim 13, being a composition 
suitable and/or intended for oral administration. 

15. Use of a modulator according to claim 1 1 and/or 12 in the preparation of a 
pharmaceutical composition. 

16. Use of a modulator according to claim 1 1 and/or 12 in the preparation of a 
pharmaceutical composition for the prevention and/or treatment of metabolic diseases. 

17. Antibody against a protein or polypeptide according to claim 8. 
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Figure 2A 




Figure 2B 
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5 



35 



40 



45 



50 



55 



SEQUENCE LISTING 

<110> Devgen N.V. 
Devgen N.V. 



<120> Kinase sequences useful for developing compounds for the 
10 prevention and/or treatment of metabolic diseases and nucleotide 
sequences encoding such kinase sequences. 

<130> P 02-011 

15 <160> 13 

<170> Patentln version 3.1 

<210> 1 

20 

<211> 2949 

<212> DNA 

25 <213> Caenorhabditis elegans 

<400> 1 



30 



atggcgcctg 


ccgtcttaca 


aaaacccggt 


gttatcaagg 


atccatcgat 


tgctgcattg 


60 


ttcagtaata 


aggatccaga 


gcagagatat 


caagatttaa 


gagaaattgg 


acatggatct 


120 


tttggagctg 


tctattttgc 


atatgacaaa 


aaaaatgagc 


agactgttgc 


gattaaaaag 


180 


atgaatttta 


gtggaaaaca 


ggctgtcgaa 


aaatggaatg 


atattcttaa 


agaagtgtct 


240 


tttctgaata 


cagttgttca 


tccacatatt 


gtcgactaca 


aggcttgttt 


tcttaaggac 


300 


actacatgtt 


ggcttgtgat 


ggagtactgt 


attggctctg 


cagccgatat 


agtggatgtc 


360 


ttgcgaaaag 


gaatgcgaga 


agtcgaaatc 


gctgcgattt 


gctctcaaac 


tttggatgct 


420 


cttcgatatc 


ttcactctct 


gaagcgaata 


catcgagata 


ttaaagctgg 


aaatattctg 


480 


ctatctgatc 


atgctattgt 


taaactagct 


gatttcggat 


ccgcatccct 


ggtagatccg 


540 


gctcaaactt 


tcatcggaac 


gccgtttttc 


atggccccag 


aggtaattct 


ggcaatggat 


600 


gagggtcact 


acacggatcg 


tgcagatatt 


tggtcattgg 


gtatcacgtg 


tatagagctg 


660 


gccgaacgtc 


gtccaccatt 


gttcagtatg 


aatgcaatgt 


ctgccctcta 


ccatattgct 


720 


caaaatgatc 


ctccaactct 


ttctccaatt 


gacactagcg 


aacaaccgga 


atggtcgctg 


780 


gaattcgttc 


aatttataga 


caaatgtctt 


cgaaaaccag 


cagaagagcg 


aatgtcagct 


840 


gaagaatgct 


ttcgacatcc 


attcattcaa 


cggtctcgcc 


catcagacac 


aattcaggaa 


900 
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ctcattcaga gaacgaaaaa 
agaaaactca tgtatttgga 
5 tctgatgatt tagattttca 
gcgtcatctc gaagtgcttc 
ggtcttttag tgtccaccaa 

10 

tacggtaatg gaagtagttc 
caaatgctct cttctacatc 
15 ggagcatcga ttacggcgat 
acatcacaac caacatcgaa 
cctttggaca cgtcgatacg 

20 

aaggaacgaa tagccacgtt 
atcaatcagg aacaagaaga 
25 tacaagcatc tacgacaagc 
ttagatagag agcaactgcg 
tactcgaaag aaaagatgag 

30 

aaagatatcg aagatgggga 
cagatgaaac tgtattcagc 

35 aaaacacgat tacgaagtct 
gtgaaagccg atctgaatcg 
cgtgcagaac ttgaagatga 
caattggaag aacaattgga 
cgtcacggat tactgtcaaa 

45 aacgagcttc acgcgatgaa 
agtcaaaatg agtacacaca 
tcaagacaac agccaagaga 

50 

caagttgtga agactcagac 
gttccaaaag atgaacaaaa 
55 gtcgcacttc ttgcttcaca 
gtgaagctcg agtcgtggca 



tatggtatta gagttggata 
tgaaacagaa ggaaaagaag 
tggaaatgaa gctaattcaa 
tcttacttct ttccgatcaa 
tacgacgggt gctatggata 
gacgacgagc tccgcacgcc 
aacgtctggt gttggaacta 
cgcagtcaat ccaacaccgt 
atcagaatca tcttctatac 
tgctccagtg aaagacttgc 
gcaaaatcac aaattcgcga 
atatacgaaa gagaacaata 
acatcacaaa gagctccaac 
tgtgaaaatg gatcgagaac 
agtgaggtgt tcacagaata 
gaaaaagatg aaaaagacga 
gcaacaattg aaagaataca 
gaacatgcct cgaagtactt 
agtgaaagat gcacgggaaa 
aattgtaagg tatcgcaggc 
tgatgaagac gtaaacgtgc 
gcagcatgaa atgacgcgcg 
aaaacgacat ttggagacac 
gaggcaacag gatgaattga 
tttaaagatc caagaagcac 
tcgccaattt aagctctacc 
agagctcacg tctcgactaa 
atacgaaagt caaatcaaaa 
agaagatgaa caacgggttc 



attttcaata 


caaaaagatg 


Q iZ n 

you 


gaagtgaagg aaatggagca 




ttggaagagc 


aggagattct 


1080 


tgcagagtag 


tggaggagct 


1140 


atgtgcatgg 


atcctctgga 


1200 


gccgtcctcc 


aattccttcg 


1260 


tgccgagtca 


tggatcagtt 


1320 


ctccttcaga 


acctatccca 


1380 


tcgaaactgc 


acacgatgat 


1440 


atatgccgca 


tcgagcagtc 


1500 


cgcttcgttc 


ccagagaata 


i c ^ r\ 
1560 


tgtatgagca 


aatgagcaag 


1 C *5 r\ 


aatttgaaga 


acgatgtgca 


iron 
1630 


tcgaacaatt 


gacaacgaca 


1 1 A f\ 


atgaactaga 


caaacggaaa 


i q n n 
1800 


aaaatagtca 


aaatcagcag 


1 Q C f\ 


agtataacaa 


ggaggcacag 


± 7£ u 


atgagaacgc 


aatgaaagaa 


i o q n 


atgattttga 


cgagaagctt 


? ClAf\ 


aacaactcag 


taatcttcat 


ZlUU 


aagaacgcca 


aatggacacg 


2160 


atttggaaat 


acagcatctc 


2220 


aacacgaggc 


ggaatcggca 


2280 


gaaaaaagca 


tgcgatgcag 


2340 


aaattcgaaa 


acaataccga 


2400 


ttacacaaat 


ggtgcaagta 


2460 


aacaggatca 


aatgcaaaaa 


2520 


aaatggttca ggataagaca 


2580 


ttagtgagaa 


gttggagaaa 


2640 
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gaattggaag aattgattgc ttatcagaag aagacgagag ccacattaga agagcagatt 2700 

aaaaaggaac gtacggcact cgaagaacga attggcacac gacgtgcaat gcttgaacag 2760 

5 

aagattattg aagaacgcga acaaatggga gaaatgcgtc gactaaagaa ggagcaaatc 2820 

cgtgatcgac acagtcaaga acgccatcgt ctcgagaatc atttcgtacg gacgggctcg 2880 

10 acgagcagaa gttctggtgg gatcgctcct ggtgttggga attcaagcag tattcagatg 2940 

gctatgtag 2949 

15 <210> 2 

<211> 982 
<212> PRT 

20 

<213> Caenorhabditis elegans 
<400> 2 

25 Met Ala Pro Ala Val Leu Gin Lys Pro Gly Val He Lys Asp Pro Ser 
15 10 15 

He Ala Ala Leu Phe Ser Asn Lys Asp Pro Glu Gin Arg Tyr Gin Asp 
20 25 30 

Leu Arg Glu He Gly His Gly Ser Phe Gly Ala Val Tyr Phe Ala Tyr 
35 40 45 



30 



Asp Lys Lys Asn Glu Gin Thr Val Ala He Lys Lys Met Asn Phe Ser 
35 50 55 60 

Gly Lys Gin Ala Val Glu Lys Trp Asn Asp He Leu Lys Glu Val Ser 
65 70 75 80 

40 Phe Leu Asn Thr Val Val His Pro His He Val Asp Tyr Lys Ala Cys 

85 90 95 

Phe Leu Lys Asp Thr Thr Cys Trp Leu Val Met Glu Tyr Cys He Gly 
100 105 110 



45 



Ser Ala Ala Asp He Val Asp Val Leu Arg Lys Gly Met Arg Glu Val 
115 120 125 



Glu He Ala Ala He Cys Ser Gin Thr Leu Asp Ala Leu Arg Tyr Leu 
50 130 135 140 

His Ser Leu Lys Arg He His Arg Asp He Lys Ala Gly Asn He Leu 

145 150 155 160 

55 Leu Ser Asp His Ala He Val Lys Leu Ala Asp Phe Gly Ser Ala Ser 

165 170 175 
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Leu Val Asp Pro Ala Gin Thr Phe lie Gly Thr Pro Phe Phe Met Ala 
180 185 190 

Pro Glu Val lie Leu Ala Met Asp Glu Gly His Tyr Thr Asp Arg Ala 
5 195 200 205 

Asp lie Trp Ser Leu Gly lie Thr Cys lie Glu Leu Ala Glu Arg Arg 
210 215 220 

10 Pro Pro Leu Phe Ser Met Asn Ala Met Ser Ala Leu Tyr His He Ala 
225 230 235 240 

Gin Asn Asp Pro Pro Thr Leu Ser Pro He Asp Thr Ser Glu Gin Pro 
245 250 255 

15 

Glu Trp Ser Leu Glu Phe Val Gin Phe He Asp Lys Cys Leu Arg Lys 
260 265 270 

Pro Ala Glu Glu Arg Met Ser Ala Glu Glu Cys Phe Arg His Pro Phe 
20 275 280 285 

He Gin Arg Ser Arg Pro Ser Asp Thr He Gin Glu Leu He Gin Arg 
290 295 300 

25 Thr Lys Asn Met Val Leu Glu Leu Asp Asn Phe Gin Tyr Lys Lys Met 
305 310 315 320 

Arg Lys Leu Met Tyr Leu Asp Glu Thr Glu Gly Lys Glu Gly Ser Glu 
325 330 335 

30 

Gly Asn Gly Ala Ser Asp Asp Leu Asp Phe His Gly Asn Glu Ala Asn 
340 345 350 

Ser He Gly Arg Ala Gly Asp Ser Ala Ser Ser Arg Ser Ala Ser Leu 
35 355 360 365 

Thr Ser Phe Arg Ser Met Gin Ser Ser Gly Gly Ala Gly Leu Leu Val 
370 375 380 

40 Ser Thr Asn Thr Thr Gly Ala Met Asp Asn Val His Gly Ser Ser Gly 
385 390 395 400 

Tyr Gly Asn Gly Ser Ser Ser Thr Thr Ser Ser Ala Arg Arg Arg Pro 
405 410 415 

45 

Pro He Pro Ser Gin Met Leu Ser Ser Thr Ser Thr Ser Gly Val Gly 
420 425 430 

Thr Met Pro Ser His Gly Ser Val Gly Ala Ser He Thr Ala He Ala 
50 435 440 445 

Val Asn Pro Thr Pro Ser Pro Ser Glu Pro He Pro Thr Ser Gin Pro 
450 455 460 

55 Thr Ser Lys Ser Glu Ser Ser Ser He Leu Glu Thr Ala His Asp Asp 
465 470 475 480 
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Pro Leu Asp Thr Ser lie Arg Ala Pro Val Lys Asp Leu His Met Pro 

.485 490 495 

His Arg Ala Val Lys Glu Arg lie Ala Thr Leu Gin Asn His Lys Phe 
5 ~* 500 505 510 

Ala Thr Leu Arg Ser Gin Arg lie He Asn Gin Glu Gin Glu Glu Tyr 

515 520 525 

10 Thr Lys Glu Asn Asn Met Tyr Glu Gin Met Ser Lys Tyr Lys His Leu 
530 535 540 

Arg Gin Ala His His Lys Glu Leu Gin Gin Phe Glu Glu Arg Cys Ala 
545 550 555 560 

Leu Asp Arg Glu Gin Leu Arg Val Lys Met Asp Arg Glu Leu Glu Gin 

565 570 575 



15 



Leu Thr Thr Thr Tyr Ser Lys Glu Lys Met Arg Val Arg Cys Ser Gin 
20 580 585 590 

Asn Asn Glu Leu Asp Lys Arg Lys Lys Asp He Glu Asp Gly Glu Lys 
595 600 605 

25 Lys Met Lys Lys Thr Lys Asn Ser Gin Asn Gin Gin Gin Met Lys Leu 

610 615 620 



30 



Tyr Ser Ala Gin Gin Leu Lys Glu Tyr Lys Tyr Asn Lys Glu Ala Gin 

625 630 635 640 

Lys Thr Arg Leu Arg Ser Leu Asn Met Pro Arg Ser Thr Tyr Glu Asn 

645 650 655 



Ala Met Lys Glu Val Lys Ala Asp Leu Asn Arg Val Lys Asp Ala Arg 
35 660 665 670 

Glu Asn Asp Phe Asp Glu Lys Leu Arg Ala Glu Leu Glu Asp Glu He 
675 680 685 

40 Val Arg Tyr Arg Arg Gin Gin Leu Ser Asn Leu His Gin Leu Glu Glu 
690 " 695 700 

Gin Leu Asp Asp Glu Asp Val Asn Val Gin Glu Arg Gin Met Asp Thr 
705 " 710 715 720 

Arg His Gly Leu Leu Ser Lys Gin His Glu Met Thr Arg Asp Leu Glu 
725 730 735 

He Gin His Leu Asn Glu Leu His Ala Met Lys Lys Arg His Leu Glu 
50 740 745 750 

Thr Gin His Glu Ala Glu Ser Ala Ser Gin Asn Glu Tyr Thr Gin Arg 
755 760 765 

55 Gin Gin Asp Glu Leu Arg Lys Lys His Ala Met Gin Ser Arg Gin Gin 
770 775 780 
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Pro Arg Asp Leu Lys lie Gin Glu Ala Gin He Arg Lys Gin Tyr Arg 
785 790 795 800 

Gin Val Val Lys Thr Gin Thr Arg Gin Phe Lys Leu Tyr Leu Thr Gin 
5 805 810 815 

Met Val Gin Val Val Pro Lys Asp Glu Gin Lys Glu Leu Thr Ser Arg 
820 825 830 

10 Leu Lys Gin Asp Gin Met Gin Lys Val Ala Leu Leu Ala Ser Gin Tyr 
835 840 845 

Glu Ser Gin He Lys Lys Met Val Gin Asp Lys Thr Val Lys Leu Glu 
850 855 860 

15 

Ser Trp Gin Glu Asp Glu Gin Arg Val Leu Ser Glu Lys Leu Glu Lys 
865 870 875 880 

Glu Leu Glu Glu Leu He Ala Tyr Gin Lys Lys Thr Arg Ala Thr Leu 
20 885 890 895 

Glu Glu Gin He Lys Lys Glu Arg Thr Ala Leu Glu Glu Arg He Gly 
900 905 910 

25 Thr Arg Arg Ala Met Leu Glu Gin Lys lie He Glu Glu Arg Glu Gin 
915 920 925 

Met Gly Glu Met Arg Arg Leu Lys Lys Glu Gin He Arg Asp Arg His 
930 935 940 

Ser Gin Glu Arg His Arg Leu Glu Asn His Phe Val Arg Thr Gly Ser 
945 ^ 950 955 960 



30 



Thr Ser Arg Ser Ser Gly Gly He Ala Pro Gly Val Gly Asn Ser Ser 
35 ~ 965 970 975 

Ser He Gin Met Ala Met 
980 

40 
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<300> 

<308> genbank NM_016281 
<309> 2002-10-04 
<313> (1) . . (4188) 
<400> 3 



gccggggaac 


aagccacagg 


agagcgactc 


aggaacaagt 


gtgggagagg 


aageggegge 


D u 


ggcggcgccg 


ggcccggggg 


tggtgacagc 


aggtctgagg 


ttgeatcata 


aatacaaagg 


1 on 


actgaagtta 


taaaagagaa 


aagagaagtt 


tgctgct aaa 


atgaatctga 


y caa u auyy a 


i ft n 


atattttgtg 


ccacacacaa 


aaaggtactg 


aagat ttacc 


ccccaaaaaa 


n a ♦* ^ a a 
ddL Ly LUaaL 


24 0 


gagaaataaa 


gctaactgat 


atcaaaaagc 


agagcct get 


ccaccggccd 




3 00 


aggggtgctg 


aaggacccag 


agattgccga 


tctattctac 




t-Lyayyaao u 


3 60 


ttttattggt 


ttgcatgaaa 


ttggacatgg 


aagttttgga 


geagee cat c 




420 


tgctcacacc 


aatgaggtgg 


tggcaattaa 


gaagatgtcc 


tatageggga 


a r"*^* a <"t a r* r* Sk 


480 


tgagaaatgg 


caagatattc 


ttaaggaagt 


taaatttt ca 


cgacaat t ga 


ayCotCCtaa 


54 0 


tactattgag 


tacaaaggct 


gttacttgaa 


agaacacact 


get tggcegg 


tyauyy aaua 


600 


ttgcttaggc 


tcagcctctg 


atttattaga 


agttcataaa 


aaaccacttc 


ayyacty uyya 


660 


gatcgctgcc 


attactcatg 


gagccttgca 


tggactagee 


tacctacat t 


CtCatytoLL 


72 0 


gattcatagg 


gatattaaag 


caggaaatat 


tcttctaaca 


gagecaggtc 


aggtaaaact 


-7 q n 


agctgatttt 


ggatctgctt 


caatggcttc 


tcctgccaac 


tccttcgtgg 


gcacacctta 


840 


ctggatggct 


ccagaggtga 


tcttagctat 


ggatgaagga 


cagtatgatg 


ggaaagttga 


900 


tatttggtca 


cttggcatca 


cttgtattga 


attggcggaa 


cggaagccgc 


cccttttcaa 


960 


catgaatgca 


atgagtgcct 


tatatcacat 


tgcccagaat 


gactccccaa 


cgttacagtc 


1020 


taatgaatgg 


acagactcct 


ttaggagatt 


tgttgattac 


tgettgeaga 


aaatacctca 


1080 


ggaaaggcca 


acatcagcag 


aactattaag 


gcatgacttt 


gttcgacgag 


accggccact 


1140 


acgtgtcctc 


attgacctca 


tacagaggac 


aaaagatgea 


gttcgtgagc 


tagataacct 


1200 


acagtaccga 


aaaatgaaaa 


aaatactttt 


ccaagagaca 


eggaatggae 


ccttgaatga 


1260 


gtcacaggag 


gatgaggaag 


acagtgaaca 


tggaaccagc 


ctgaacaggg 


aaatggacag 


1320 


cctgggcagc 


aaccattcca 


ttccaagcat 


gtccgtgagc 


acaggcagcc 


agagcagcag 


1380 


tgtgaacagc 


atgcaggaag 


tcatggacga 


gagcagttcc 


gaacttgtca 


tgatgeaega 


1440 



WO 2004/056982 

tgacgaaagc acaatcaatt ccagctcctc 
aagggatgag gcgggccacg gcgatcccag 

5 

gagccaggcc ctccactacc ggaacagaga 
ggttacacga cagatccatg agcatgagca 
10 ttataagcgg atgcggcgcc agcaccagaa 
ggctgagatg gacgagcacc gcctcaagct 
' ctcgtccatc gagctggaga agctggccaa 

15 

aaaggtagct gcagcagatg agaagaagtt 
agatttgaca actttcttag aaagtcagaa 
20 aaaagaggaa atgaatgagg accatagcac 
caaacataaa gagaacttgc agcacacaca 
acagagactg tactacgaca aaaattgtcg 

25 

gcacgaggtg gagcagcaga acattcggga 
gatggagcat gccatgctaa tccggcacga 
30 gctgcacacg ttacagaagc tacgcatgga 
ggaaaaccag ctggagtaca ataagaggcg 
ggaacttcgg caacagccaa aaaacttaaa 

35 

tcaggacact tgcaaagtac agaccaaaca 
agttactcca aagaatgagc acaaaacaat 
40 aaaacttgcc attttggcag agcagtatga 
agcgttacgg ctagatgagg ctcaagaagc 
gcaggaaatg gagctgctca acgcctacca 

45 

acatgaacgt gagctccaga agctagagca 
gcagaagatt gaagaggagc tggctgccct 
50 cctattggaa aggcaagagc gagagattga 
atttgggaat ttggttacat tagattttcc 
ttgccattta caaaaaaaaa aaaaaaaaga 

55 

cacattcccc attttaacgg gcgttgctct 
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cgtcgtgcat 


aagaaagatc 


atgtattcat 


1500 


gcctgagccg 


cggcctaccc 


agtcagttca 


1560 


gcgctttgcc 


acgatcaaat 


cagcatcttt 


1620 


ggagaacgag 


ttgcgggaac 


agatgtcagg 


1680 


gcagctgatc 


gccctggaga 


acaagctgaa 


1740 


acagaaggag 


gtggagacgc 


atgccaacaa 


1800 


gaagcaagtg 


gctatcatag 


aaaaggaggc 


1860 


ccagcaacag 


atcttggccc 


agcagaagaa 


1920 


gaagcagtat 


aagatttgta 


aggaaaaaat 


1980 


acccaagaaa 


gagaagcaag 


agcggatctc 


2040 


ggctgaagag 


gaagcccacc 


ttctcactca 


2100 


tttcttcaag 


cggaaaataa 


tgatcaagcg 


2160 


ggaactaaat 


aaaaagagga 


cccagaagga 


2220 


cgagtccacc 


cgagagctag agtacaggca 


2280 


tctgatccgt 


ttacagcacc 


agacggaact 


2340 


agaaagagaa 


ctgcacagaa 


agcatgtcat 


2400 


ggccatggaa 


atgcaaatta 


aaaaacagtt 


2460 


gtataaagca 


ctcaagaatc 


accagttgga 


2520 


cttaaagaca 


ctgaaagatg 


agcagacaag 


2580 


acagagtata 


aatgaaatga 


tggcctctca 


2640 


agaatgccag 


gccttgaggc 


tacagctcca 


2700 




aagatgcaaa 


cagaggcaca 


2760 


gagagtgtct 


ctgcgcagag 


cacaccttga 


2820 


tcagaaggaa 


cgcagcgaga 


gaataaagaa 


2880 


aacttttgac 


atggagagcc 


tcagaatggg 


2940 


taaggaggac 


tacagatgag 


attaaatttt 


3000 


aaacagaaaa 


aaattcagac 


cctgcaaaac 


3060 


cactctctct 


ctctcttact 


cttactgaca 


3120 
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tcgtgtcgga 


ctagtgcctg 


tttattctta 


ctccatcagg ggcccccttc 


ctccccccgt 


3180 




gtcaactttc 


agtgctggcc 


aaaacctggc 


cgtctcttct 


attcacagta 


cacgtcacag 


3240 


5 


tattgatgtg 


attcaaaatg 


tttcagtgaa 


aactttggag 


acagt tttaa 


caaaaccaat 


3300 




aaaccaacaa 


caaaaaaagt 


ggatgtatat 


tgc tttaagc 


aatcactcat 


taccaccaat 


3360 


10 


ctgtgaaagt 


aaagcaaaaa 


ataataataa 


taaatgccaa 


qqqqqaq aq a 


gacacaatat 


3420 


ccgcagcctt 


acaccttaac 


tagctgctgc 


attattttat 


tttattttat 


ttttttggta 


3480 




tttattcatc 


aggaataaaa 


aaaacaaagt 


t ttat taaag 


attgaaaatt 


tgatacattt 


3540 


15 


tacagaaact 


aattgtgatg 


tacatatcag 


tggtgacata. 


ttattacttt 


tttggggacg 


3600 




ggggtgggtg 


gggtgaagag 


atcttgtgat 


tt tagactgc 


tgcagagtta 


acttgtctca 


3660 


20 


gcatatctga 


tgtatcataa 


tcatttctgc 


tqtqcaqaqq 


agggatacac 


ttaggggctc 


3720 


acagatccca 


gtagcacaat 


tgggctttgg 


caaatgggta 


ttttgtgtat 


agaggaattt 


3780 




aaggagaggt 


attacttatt 


ttcatattgt 


attttaactg 


tttctcggat 


caaatttttt 


3840 


25 


aacttcttct 


tcgtgttctt 


ccccacctcc 


ttccttttcc 


agttcagtat 


ttggagttca 


3900 




acactgtctc 


tcaatcagat 


catctggatc 


tttttcttta 


tctcccttcc 


ccttcctaag 


3960 


30 


tcccatttct 


tggtcataaa 


tattgcatta 


ttcacacttt 


caaactgtgt 


attttcttac 


4020 


aataaaaaat 


gatgaaaaaa 


aaaaaggctt 


tacttctttt 


gcatgcactt 


taaaaacaaa 


4080 




acaaaacatt 


tttcaggttc 


caaggaagag 


catgataact 


gtcagagctt 


ttaattatat 


4140 


35 


ttgtaaataa 


aagtgttcat 


cacaaaaaaa 


aaaaaaaaaa 


aaaaaaaa 




4188 



<210> 4 

40 <211> 898 

<212> PRT 

<213> Homo sapiens 



45 



50 



55 



<300> 

<308> Genbank NP_057365.2 

<309> 2002-10-01 

<313> (1) . . (898) 

<400> 4 



WO 2004/056982 PCT/EP2003/0 14674 

53 



Met Arg Lys Gly Val Leu Lys Asp Pro Glu He Ala Asp Leu Phe Tyr 
1 5 10 15 

Lys Asp Asp Pro Glu Glu Leu Phe He Gly Leu His Glu lie Gly His 
5 20 25 30 

Gly Ser Phe Gly Ala Val Tyr Phe Ala Thr Asn Ala His Thr Asn Glu 
35 40 45 

10 Val val Ala He Lys Lys Met Ser Tyr Ser Gly Lys Gin Thr His Glu 
50 55 60 

Lys Trp Gin Asp He Leu Lys Glu Val Lys Phe Leu Arg Gin Leu Lys 
65 70 75 80 

15 

His Pro Asn Thr He Glu Tyr Lys Gly Cys Tyr Leu Lys Glu His Thr 
85 90 95 

Ala Trp Leu Val Met Glu Tyr Cys Leu Gly Ser Ala Ser Asp Leu Leu 
20 100 105 HO 

Glu Val His Lys Lys Pro Leu Gin Glu Val Glu He Ala Ala He Thr 
115 120 125 

25 His Gly Ala Leu His Gly Leu Ala Tyr Leu His Ser His Ala Leu He 
130 135 140 

His Arg Asp lie Lys Ala Gly Asn He Leu Leu Thr Glu Pro Gly Gin 
145 ~* * 150 155 160 

Val Lys Leu Ala Asp Phe Gly Ser Ala Ser Met Ala Ser Pro Ala Asn 
165 170 175 



30 



Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu Val lie Leu Ala 

35 180 185 190 

Met Asp Glu Gly Gin Tyr Asp Gly Lys Val Asp lie Trp Ser Leu Gly 
195 200 205 

40 lie Thr Cys lie Glu Leu Ala Glu Arg Lys Pro Pro Leu Phe Asn Met 
210 215 220 



45 



Asn Ala Met Ser Ala Leu Tyr His He Ala Gin Asn Asp Ser Pro Thr 
225 230 235 240 

Leu Gin Ser Asn Glu Trp Thr Asp Ser Phe Arg Arg Phe Val Asp Tyr 
245 250 255 



Cys Leu Gin Lys lie Pro Gin Glu Arg Pro Thr Ser Ala Glu Leu Leu 

50 260 265 270 

Arg His Asp Phe Val Arg Arg Asp Arg Pro Leu Arg Val Leu lie Asp 

275 280 285 

55 Leu He Gin Arg Thr Lys Asp Ala Val Arg Glu Leu Asp Asn Leu Gin 
290 295 300 
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Tyr Arg Lys Met Lys Lys lie Leu Phe Gin Glu Thr Arg Asn Gly Pro 
305 ~ 310 315 320 

Leu Asn Glu Ser Gin Glu Asp Glu Glu Asp Ser Glu His Gly Thr Ser 
5 325 330 335 

Leu Asn Arg Glu Met Asp Ser Leu Gly Ser Asn His Ser lie Pro Ser 
340 345 350 

10 Met Ser Val Ser Thr Gly Ser Gin Ser Ser Ser Val Asn Ser Met Gin 
355 360 365 

Glu Val Met Asp Glu Ser Ser Ser Glu Leu Val Met Met His Asp Asp 
370 375 380 

15 

Glu Ser Thr lie Asn Ser Ser Ser Ser Val Val His Lys Lys Asp His 
385 390 395 400 

Val Phe lie Arg Asp Glu Ala Gly His Gly Asp Pro Arg Pro Glu Pro 
20 405 410 415 

Arg Pro Thr Gin Ser Val Gin Ser Gin Ala Leu His Tyr Arg Asn Arg 
420 425 430 

25 Glu Arg Phe Ala Thr lie Lys Ser Ala Ser Leu Val Thr Arg Gin lie 
435 440 445 

His Glu His Glu Gin Glu Asn Glu Leu Arg Glu Gin Met Ser Gly Tyr 
450 455 460 

30 

Lys Arg Met Arg Arg Gin His Gin Lys Gin Leu lie Ala Leu Glu Asn 
465 470 475 480 

Lys Leu Lys Ala Glu Met Asp Glu His Arg Leu Lys Leu Gin Lys Glu 
35 485 490 495 

Val Glu Thr His Ala Asn Asn Ser Ser lie Glu Leu Glu Lys Leu Ala 
500 505 510 

40 Lys Lys Gin Val Ala lie lie Glu Lys Glu Ala Lys Val Ala Ala Ala 
515 520 525 

Asp Glu Lys Lys Phe Gin Gin Gin lie Leu Ala Gin Gin Lys Lys Asp 
530 535 540 

45 

Leu Thr Thr Phe Leu Glu Ser Gin Lys Lys Gin Tyr Lys lie Cys Lys 
545 550 555 560 

Glu Lys lie Lys Glu Glu Met Asn Glu Asp His Ser Thr Pro Lys Lys 
50 565 570 575 

Glu Lys Gin Glu Arg lie Ser Lys His Lys Glu Asn Leu Gin His Thr 
580 585 590 

55 Gin Ala Glu Glu Glu Ala His Leu Leu Thr Gin Gin Arg Leu Tyr Tyr 
595 600 605 
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Asp Lys Asn Cys Arg Phe Phe Lys Arg Lys lie Met lie Lys Arg His 
610 615 620 

Glu Val Glu Gin Gin Asn He Arg Glu Glu Leu Asn Lys Lys Arg Thr 
5 625 630 635 640 

Gin Lys Glu Met Glu His Ala Met Leu He Arg His Asp Glu Ser Thr 
645 650 655 

10 Arg Glu Leu Glu Tyr Arg Gin Leu His Thr Leu Gin Lys Leu Arg Met 
660 665 670 

Asp Leu He Arg Leu Gin His Gin Thr Glu Leu Glu Asn Gin Leu Glu 
675 680 685 

15 

Tyr Asn Lys Arg Arg Glu Arg Glu Leu His Arg Lys His Val Met Glu 
690 695 7 00 

Leu Arg Gin Gin Pro Lys Asn Leu Lys Ala Met Glu Met Gin He Lys 
20 705 710 715 720 

Lys Gin Phe Gin Asp Thr Cys Lys Val Gin Thr Lys Gin Tyr Lys Ala 
725 730 735 

25 Leu Lys Asn His Gin Leu Glu Val Thr Pro Lys Asn Glu His Lys Thr 
740 745 750 

He Leu Lys Thr Leu Lys Asp Glu Gin Thr Arg Lys Leu Ala He Leu 
755 760 765 

Ala Glu Gin Tyr Glu Gin Ser He Asn Glu Met Met Ala Ser Gin Ala 
770 775 780 



30 



Leu Arg Leu Asp Glu Ala Gin Glu Ala Glu Cys Gin Ala Leu Arg Leu 
35 785 790 795 800 

Gin Leu Gin Gin Glu Met Glu Leu Leu Asn Ala Tyr Gin Ser Lys He 
805 810 815 

40 Lys Met Gin Thr Glu Ala Gin His Glu Arg Glu Leu Gin Lys Leu Glu 
820 825 830 

Gin Arg Val Ser Leu Arg Arg Ala His Leu Glu Gin Lys He Glu Glu 
835 840 845 

45 

Glu Leu Ala Ala Leu Gin Lys Glu Arg Ser Glu Arg He Lys Asn Leu 
850 855 860 

Leu Glu Arg Gin Glu Arg Glu He Glu Thr Phe Asp Met Glu Ser Leu 
50 865 " 870 875 880 

Arg Met Gly Phe Gly Asn Leu Val Thr Leu Asp Phe Pro Lys Glu Asp 
885 890 895 



55 Tyr Arg 
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<210> 5 
<211> 4971 

5 

<212> DNA 

<213> Homo sapiens 

10 

<300> 

<308> Genbank NM_016151.1 
15 <3Q9> 2002-11-05 
<313> (1) . . (4971) 



20 


<400> 5 
aattcggcac 


gagctgagac 


ggagaagagg 


agaggcagag agggcgcggg 


gaccgtcagc 


60 




agcaccttag ctacaatcgt 


tcagctattc 


tcggaagaga gaagggagag 


ggaggaggcc 


120 


25 


ggggcgggag 


tgggggctgt 


caccctcgga 


ccccggcgtg agaggggccg 


tgcggccgga 


180 




cgtcctcggg 


gtgggccccc 


agtcggtggc 


cgaagaccta cagctcaggc 


ccctgggtcc 


240 


30 


caaatttcca 


ggctttgccc 


ctcctccttt 


ctcagatacc cgggtaacag 


tcctcatagt 


300 


ccagatatcc 


gggactcggg 


tcccaacctc 


tctaaacctg ggtctctgtt 


tcatagattt 


360 




tcaaatatca 


ggttcaggcc 


cctgcgtgca 


ccagtatccg gggttcattc 


cccgggcgtt 


420 


35 


tcaaatatcg 


gattcagtct 


ccatcccgtt 


cagatattcg gggttcagac 


cccacaatca 


480 




gaaatccgga 


attcggcagc 


tgtcgccctc 


gacgaggggg aggactggac 


cgcgaggtca 


540 


40 


gattaggttg 


tcaccccctc 


ccctccaggg 


gaggcttccc gggcccgccc 


ctcaggaagg 


600 


gcgaaagccg 


aggaagaggt 


ggcaagggga 


aaggtctcct tgcccctctc 


cctgcttggc 


660 




agagccgctg 


gaggacccca 


ggcggaagcg 


gaggcgctgg ggcaccatag 


tgacccctac 


720 


45 


caggccaggc 


cccactctca 


gggcccccag 


gggccaccat gccagctggg 


ggccgggccg 


780 




ggagcctgaa 


ggacccagat 


gtggctgagc 


tcttcttcaa ggatgaccca 


gaaaagctct 


840 


50 


tctctgacct 


ccgggaaatt 


ggccatggca 


gctttggagc cgtatacttt 


gcccgggatg 


900 


tccggaatag 


tgaggtggtg 


gccatcaaga 


agatgtccta cagtgggaag 


cagtccaatg 


960 




agaaatggca 


agacatcatc 


aaggaggtgc 


ggttcttaca gaagctccgg 


catcccaaca 


1020 


55 


ccattcagta 


ccggggctgt 


tacctgaggg 


agcacacggc ttggctggta 


atggagtatt 


1080 




gcctgggctc 


agcttctgac 


cttctagaag 


tgcacaagaa accccttcag 


gaggtagaga 


1140 
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tcgcagctgt gacccacggg gcgcttcagg 
tccataggga tgtgaaggct ggaaacatcc 

5 

gggactttgg ttctgcgtcc atcatggcac 
ggatggcacc cgaggtgatc ctggccatgg 
10 tctggtcctt ggggataacc tgcatcgagc 
tgaatgcgat gagtgcctta taccacattg 
gacactggtc tgagtacttc cggaattttg 

15 

acagaccaac ctcagaggtt ctcctgaagc 
cagtcatcat ggacctgatc cagaggacca 
20 agtaccgcaa gatgaagaag atcctgttcc 
ccccagagga ggaagaggag gccgagccct 
tcgagagtag ccactcagtg cccagcatgt 

25 

tcaacagcct agcagatgcc tcagacaacg 
aggaggagga agaaggccct gaagcccggg 
30 cagtcacctc tcacagctcc attatccacc 
acccctacca gccagagata acccccagcc 
cttccaccac ctcttccgcc cgccgccggg 

35 

ccatccgaac cgcctccctg gtcagccgtc 
tgcgggagca gctgagcggc tataagcgga 
40 ccctggagtc acggctgagg ggtgaacggg 
ttgaggcgca gcgggctggc tttggggcag 
ccataggtga gaaggaggca cgagctgccc 

45 

tccttgggca gcagaagaag gagctggctg 
aacttcgcaa ggaacagctg aaggaggagc 
50 agaaggccga gtggctgctg cggcagaagg 
aagcagggct gctgcggcgg cagcgccagt 
gcaagatgtt gctggctcgg cacagcctgg 

55 

agaagcagac ccagaaggac ttggagtgtg 



gcctggcata 


tctgcactcc 


cacaacatga 


1200 


tgctgtcaga 


gccagggtta 


gtgaagctag 


1260 


ctgccaactc 


cttcgtgggc 


accccatact 


1320 


atgaggggca 


gtacgatggc 


aaagtggacg 


1380 


tggctgaacg 


gaaaccaccg 


ctctttaaca 


1440 


cacagaacga 


atcccccgtg 


ctccagtcag 


1500 


tcgactcctg 


tcttcagaaa 


atccctcaag 


1560 


accgctttgt 


gctccgggag 


cggccaccca 


1620 


aggatgccgt 


gcgggagctg 


gacaacctgc 


1680 


aagaggcacc 


caacggccct 


ggtgccgagg 


1740 


acatgcaccg ggccgggact 


ctgaccagcc 


1800 


ccatcagcgc 


ctccagccag 


agcagctccg 


1860 


aggaagagga 


ggaggaggag 


gaggaagagg 


1920 


agatggccat 


gatgcaggag 


ggggagcaca 


1980 


ggctgccggg 


ctctgacaac 


ctatatgatg 


2040 


ctctccagcc 


gcctgcagcc 


ccagctccca 


2100 


cctactgccg 


taaccgagac 


cactttgcca 


2160 


agatccagga 


gcatgagcag 


gactctgcgc 


2220 


tgcgacgaca 


gcaccagaag 


cagctgctgg 


2280 


aggagcacag 


tgcacggctg 


cagcgggagc 


2340 


aggcagaaaa 


gctggcccgg 


cggcaccagg 


2400 


aggccgagga 


gcggaagttc 


cagcagcaca 


2460 


ccctgctgga 


ggcacagaag 


cggacctaca 


2520 


tccaggagaa 


ccccagcact 


cccaagcggg 


2580 


agcagctcca 


gcagtgccag 


gcggaggagg 


2640 


actttgagct 


gcagtgtcgc 


cagtacaagc 


2700 


accaggacct 


gctgcgggag 


gacctgaaca 


2760 


cactgctgct 


tcggcagcac 


gaggccacgc 


2820 
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gggagctgga gctgcggcag ctccaggccg 
tgcagcacca gacggagctg ggcaaccagc 
5 tgcggcagaa gcatgcggcc caggttcgcc 
gccagcgccc cccgggcctt ccactcccca 
gcacccctat agaacagcag ccctgctcac 

10 

tgcttggcga ggaggaggaa gcagttggag 
ctttggagcc caagcagcag aggattctgg 
15 cacaaaaaca tgggagcctg gttgatgagg 
agcttagggt gccctccctt gtaccccagg 
ggacgtggag cttgtggggg aaggaggatg 

20 

gctgggtcca gggcccagca ctgactcccg 
gggctccgat tgggacccct agggatcctg 
25 ctgaaccccc tccaacacac ctgaggccct 
cccatggcct cctggccggc ctctcctttg 
tcctgctgct gctgctgctt ccattgctgg 

30 

cgctgctggc ccttgaggtg gggctggtgg 
cagccctgca cctgccctcc agtcttttcc 
35 ccgtcctggg cctgagctgg cgccgaggcc 
cctggctctt agcttggcca ggcctagctc 
gatgggtgcg gcagcagggc ccccgggtgc 

40 

ttctgctgcg cctgtcaccc atggccttcc 
accggggtct gtttgcactg taccccaaaa 
45 ccgtccctgg gccccggcgg cgtaatcccc 
caagggtctg ggtcctgtgc aagggctgga 
tagcatccca cttgcccccg tgggccatcc 

50 

gtgaacggcc cacccgaatc ccccggctac 
ctgcctccca ccagccactg ccagggactc 
55 cccgggccct gcccccctgg aggtagctga 
ttgagcactt tatctcccac gactcagtga 



tgcagcgcac gcgggctgag 


ctcacccgcc 


2880 


tggagtacaa 


caaqcqqcgt 


gagcaagagt 


2940 


agcagcccaa 


gagcctcaaa 


gtacgtgcag 


3000 


ttcctqqqqc 


tctgggccca 


cccaacacag 


3060 


CtQQCCaQQa 


qqcaqt CCtq 

33 x ""*3 3 


gaccaaagaa 


3120 


aqaqaaqqat 


tctqqqaaaq 


gaaggggcca 


3180 


gggaagaat c 


aqqaqcccct 


agtcccagtc 


3240 


aaqt ttqqqq 


tctgcctgag 


gagatagagg 


3300 


aaaaoaacat 


tqt tqqccaq 


gaggaggctg 


3360 


agagt cttct 


qqatqaqqaq 

33»"3»33»3 


tttgagcttg 


3420 


tccctqaqqa 


qqaqqaaqaa 

33 v *33 MUl 3"" 


gaggaagagg 


3480 


qaqa tqq t tq 

^3 -J J^3 3 


tcct tccccc 


gacatccctc 


3540 


qccctqccaq 


ccagctccct 


ggactcctgt 


3600 


caqtqqqqtC 

-3 3 333 


ctcctctggc 


ctcctgcccc 


3660 


caqcccagqq 

3 3J3-3 


tgggggtggc 


ctgcaggcag 


3720 


qtctqqqqqc 


ctcctacctg 


ctcctttgta 


3780 


tactcctggc 


ccagggtacc 


gcactggggg 


3840 


tcatqqqtqt 


tcccctgggc 


cttggagctg 


3900 


tacctctggt 


qqctatqqca 

33 ^3 


gcggggggca 


3960 


qccqqqqcat 


atctcgactc 


tggttgcggg 


4020 


qqqccctoca 


aaactqt qqq 

3 33 w 3 w 33 3 


gctgtggggg 


4080 


ccaacaaaaa 


tggct t ccgc 


agccgcctgc 


4140 


gcaccaccca 


acacccatta 


gctctgttgg 


4200 


actggcgtct 


ggcacgggcc 


agccagggtt 


4260 


acacactggc 


cagctggggc 


ctgcttcggg 


4320 


taccacgcag 


ccagcgccag 


ctagggcccc 


4380 


tagccgggcg 


gaggtcacgc 


acccgccagt 


4440 


ctccagccct 


tccagcccaa 


atctagagca 


4500 


agtttctcca 


gtccctagtc 


ctctcttttc 


4560 



5 
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acccaccttc ctcagtttgc tcacttaccc caggcccagc ccttcggacc tctagacagg 4620 
cagcctcctc agctgtggag tccagcagtc actctgtgtt ctcctggcgc tcctccccta 4680 
agttattgct gttcgcccgc tgtgtgtgct catcctcacc ctcattgact caggcctggg 4740 
gccaggggtg gtggagggtg ggaagagtca tgtttttttt ctcctctttg attttgtttt 4800 
10 tctgtctccc ttccaacctg tccccttccc cccaccaaaa aaagaaaaag acaaacacaa 4860 
ataaaatatc tgagcggaac tgtgaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4 920 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 4 971 

15 

<210> 6 
<211> 1235 

20 

<212> PRT 

<213> Homo sapiens 

25 

<300> 

<308> Genbank NP_057235.1 
30 <309> 2002-11-05 
<313> (D..U235) 

35 <400> 6 

Met Pro Ala Gly Gly Arg Ala Gly Ser Leu Lys Asp Pro Asp Val Ala 
1 5 10 15 

40 Glu Leu Phe Phe Lys Asp Asp Pro Glu Lys Leu Phe Ser Asp Leu Arg 
20 25 30 

Glu He Gly His Gly Ser Phe Gly Ala Val Tyr Phe Ala Arg Asp Val 
35 40 45 



45 



Arg Asn Ser Glu Val Val Ala He Lys Lys Met Ser Tyr Ser Gly Lys 
50 55 60 



Gin Ser Asn Glu Lys Trp Gin Asp He He Lys Glu Val Arg Phe Leu 
50 65 70 75 80 

Gin Lys Leu Arg His Pro Asn Thr He Gin Tyr Arg Gly Cys Tyr Leu 
85 90 95 

55 Arg Glu His Thr Ala Trp Leu Val Met Glu Tyr Cys Leu Gly Ser Ala 
100 105 110 
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Ser Asp Leu Leu Glu Val His Lys Lys Pro Leu Gin Glu Val Glu He 
115 120 125 

Ala Ala Val Thr His Gly Ala Leu Gin Gly Leu Ala Tyr Leu His Ser 
5 130 135 140 

His Asn Met He His Arg Asp Val Lys Ala Gly Asn He Leu Leu Ser 
145 150 155 160 

10 Glu Pro Gly Leu Val Lys Leu Gly Asp Phe Gly Ser Ala Ser He Met 

165 170 175 

Ala Pro Ala Asn Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu 
180 185 190 

15 

val He Leu Ala Met Asp Glu Gly Gin Tyr Asp Gly Lys Val Asp Val 
195 200 205 

Trp Ser Leu Gly He Thr Cys He Glu Leu Ala Glu Arg Lys Pro Pro 
20 210 215 220 

Leu Phe Asn Met Asn Ala Met Ser Ala Leu Tyr His He Ala Gin Asn 
225 230 235 240 

25 Glu Ser Pro Val Leu Gin Ser Gly His Trp Ser Glu Tyr Phe Arg Asn 

245 250 255 



30 



Phe Val Asp Ser Cys Leu Gin Lys He Pro Gin Asp Arg Pro Thr Ser 

260 265 270 

Glu Val Leu Leu Lys His Arg Phe Val Leu Arg Glu Arg Pro Pro Thr 

275 280 285 



Val He Met Asp Leu He Gin Arg Thr Lys Asp Ala Val Arg Glu Leu 
35 290 295 300 

Asp Asn Leu Gin Tyr Arg Lys Met Lys Lys He Leu Phe Gin Glu Ala 
305 310 315 320 

40 Pro Asn Gly Pro Gly Ala Glu Ala Pro Glu Glu Glu Glu Glu Ala Glu 

325 330 335 



45 



Pro Tyr Met His Arg Ala Gly Thr Leu Thr Ser Leu Glu Ser Ser His 

340 345 350 

Ser Val Pro Ser Met Ser He Ser Ala Ser Ser Gin Ser Ser Ser Val 
355 360 365 



Asn Ser Leu Ala Asp Ala Ser Asp Asn Glu Glu Glu Glu Glu Glu Glu 
50 370 375 380 

Glu Glu Glu Glu Glu Glu Glu Glu Gly Pro Glu Ala Arg Glu Met Ala 
385 390 395 400 

55 Met Met Gin Glu Gly Glu His Thr Val Thr Ser His Ser Ser He He 

405 410 415 
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His Arg Leu Pro Gly Ser Asp Asn Leu Tyr Asp Asp Pro Tyr Gin Pro 
420 425 430 

Glu lie Thr Pro Ser Pro Leu Gin Pro Pro Ala Ala Pro Ala Pro Thr 
5 435 440 445 

Ser Thr Thr Ser Ser Ala Arg Arg Arg Ala Tyr Cys Arg Asn Arg Asp 
450 455 460 

10 His Phe Ala Thr lie Arg Thr Ala Ser Leu Val Ser Arg Gin lie Gin 
465 470 475 480 

Glu His Glu Gin Asp Ser Ala Leu Arg Glu Gin Leu Ser Gly Tyr Lys 
485 490 495 

15 

Arg Met Arg Arg Gin His Gin Lys Gin Leu Leu Ala Leu Glu Ser Arg 
500 505 510 

Leu Arg Gly Glu Arg Glu Glu His Ser Ala Arg Leu Gin Arg Glu Leu 
20 515 520 "* 525 

Glu Ala Gin Arg Ala Gly Phe Gly Ala Glu Ala Glu Lys Leu Ala Arg 
530 535 540 

25 Arg His Gin Ala lie Gly Glu Lys Glu Ala Arg Ala Ala Gin Ala Glu 
545 550 555 560 



30 



Glu Arg Lys Phe Gin Gin His lie Leu Gly Gin Gin Lys Lys Glu Leu 
565 570 575 

Ala Ala Leu Leu Glu Ala Gin Lys Arg Thr Tyr Lys Leu Arg Lys Glu 
580 585 590 



Gin Leu Lys Glu Glu Leu Gin Glu Asn Pro Ser Thr Pro Lys Arg Glu 
35 595 600 605 

Lys Ala Glu Trp Leu Leu Arg Gin Lys Glu Gin Leu Gin Gin Cys Gin 
610 615 620 

40 Ala Glu Glu Glu Ala Gly Leu Leu Arg Arg Gin Arg Gin Tyr Phe Glu 
625 630 635 640 

Leu Gin Cys Arg Gin Tyr Lys Arg Lys Met Leu Leu Ala Arg His Ser 
645 650 655 

45 

Leu Asp Gin Asp Leu Leu Arg Glu Asp Leu Asn Lys Lys Gin Thr Gin 
660 665 670 

Lys Asp Leu Glu Cys Ala Leu Leu Leu Arg Gin His Glu Ala Thr Arg 
50 675 680 685 

Glu Leu Glu Leu Arg Gin Leu Gin Ala Val Gin Arg Thr Arg Ala Glu 
690 695 700 

55 Leu Thr Arg Leu Gin His Gin Thr Glu Leu Gly Asn Gin Leu Glu Tyr 
705 710 715 720 
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Asn Lys Arg Arg Glu Gin Glu Leu Arg Gin Lys His Ala Ala Gin Val 
725 730 735 

Arg Gin Gin Pro Lys Ser Leu Lys Val Arg Ala Gly Gin Arg Pro Pro 
5 740 745 750 

Gly Leu Pro Leu Pro lie Pro Gly Ala Leu Gly Pro Pro Asn Thr Gly 
755 760 765 

10 Thr Pro He Glu Gin Gin Pro Cys Ser Pro Gly Gin Glu Ala Val Leu 
770 775 780 

Asp Gin Arg Met Leu Gly Glu Glu Glu Glu Ala Val Gly Glu Arg Arg 
785 790 795 800 

15 

He Leu Gly Lys Glu Gly Ala Thr Leu Glu Pro Lys Gin Gin Arg He 
805 810 815 

Leu Gly Glu Glu Ser Gly Ala Pro Ser Pro Ser Pro Gin Lys His Gly 
20 820 825 830 

Ser Leu Val Asp Glu Glu Val Trp Gly Leu Pro Glu Glu He Glu Glu 
835 840 845 

25 Leu Arg Val Pro Ser Leu Val Pro Gin Glu Arg Ser He Val Gly Gin 
850 855 860 

Glu Glu Ala Gly Thr Trp Ser Leu Trp Gly Lys Glu Asp Glu Ser Leu 
865 870 875 880 

30 

Leu Asp Glu Glu Phe Glu Leu Gly Trp Val Gin Gly Pro Ala Leu Thr 
885 890 895 

Pro Val Pro Glu Glu Glu Glu Glu Glu Glu Glu Gly Ala Pro He Gly 
35 900 905 910 

Thr Pro Arg Asp Pro Gly Asp Gly Cys Pro Ser Pro Asp He Pro Pro 
915 920 925 

40 Glu Pro Pro Pro Thr His Leu Arg Pro Cys Pro Ala Ser Gin Leu Pro 
930 935 940 

Gly Leu Leu Ser His Gly Leu Leu Ala Gly Leu Ser Phe Ala val Gly 
945 950 955 960 

45 

Ser Ser Ser Gly Leu Leu Pro Leu Leu Leu Leu Leu Leu Leu Pro Leu 
965 970 975 

Leu Ala Ala Gin Gly Gly Gly Gly Leu Gin Ala Ala Leu Leu Ala Leu 
50 980 985 990 

Glu Val Gly Leu Val Gly Leu Gly Ala Ser Tyr Leu Leu Leu Cys Thr 
995 1000 1005 

55 Ala Leu His Leu Pro Ser Ser Leu Phe Leu Leu Leu Ala Gin Gly 
1010 1015 1020 
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Thr Ala Leu Gly Ala Val Leu Gly Leu Ser Trp Arg Arg Gly Leu 
1025 1030 1035 

Met Gly Val Pro Leu Gly Leu Gly Ala Ala Trp Leu Leu Ala Trp 
5 1040 1045 1050 

Pro Gly Leu Ala Leu Pro Leu Val Ala Met Ala Ala Gly Gly Arg 
1055 1060 1065 

10 Trp Val Arg Gin Gin Gly Pro Arg Val Arg Arg Gly lie Ser Arg 
1070 1075 1080 

Leu Trp Leu Arg Val Leu Leu Arg Leu Ser Pro Met Ala Phe Arg 
1085 1090 1095 



15 



30 



45 



50 



55 



Ala Leu Gin Gly Cys Gly Ala Val Gly Asp Arg Gly Leu Phe Ala 
1100 1105 1110 



Leu Tyr Pro Lys Thr Asn Lys Asp Gly Phe Arg Ser Arg Leu Pro 
20 1115 1120 1125 

Val Pro Gly Pro Arg Arg Arg Asn Pro Arg Thr Thr Gin His Pro 
1130 1135 1140 

25 Leu Ala Leu Leu Ala Arg Val Trp Val Leu Cys Lys Gly Trp Asn 
1145 1150 1155 



Trp Arg Leu Ala Arg Ala Ser Gin Gly Leu Ala Ser His Leu Pro 
1160 1165 1170 

Pro Trp Ala lie His Thr Leu Ala Ser Trp Gly Leu Leu Arg Gly 
1175 1180 1185 



Glu Arg Pro Thr Arg lie Pro Arg Leu Leu Pro Arg Ser Gin Arg 
35 1190 1195 1200 

Gin Leu Gly Pro Pro Ala Ser His Gin Pro Leu Pro Gly Thr Leu 
1205 1210 1215 

40 Ala Gly Arg Arg Ser Arg Thr Arg Gin Ser Arg Ala Leu Pro Pro 
1220 1225 1230 



Trp Arg 
1235 



<210> 7 

<211> 4242 

<212> DNA 

<213> Homo sapiens 

<300> 
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<308> Genbank NM_004783 
<309> 2000-11-01 
5 <313> (1)..(4242) 

<400> 7 

agaatttcaa atatcaggtt caggcccctg 

ggcgttcaaa tatcggattc agtctccatc 

aatcagaaat ccggaattcg gcagctgtcg 

15 ggtcagatta ggttgtcacc ccctcccctc 

gaagggcgaa agccgaggaa gaggtggcaa 

ttggcagagc cgctggagga ccccaggcgg 

cctaccaggc caggccccac tctcagggcc 

ggccgggagc ctgaaggacc cagatgtggc 

25 gctcttctct gacctccggg aaattggcca 

ggatgtccgg aatagtgagg tggtggccat 

caatgagaaa tggcaagaca tcatcaagga 

30 

caacaccatt cagtaccggg gctgttacct 

gtattgcctg ggctcagctt ctgaccttct 

35 agagatcgca gctgtgaccc acggggcgct 

catgatccat agggatgtga aggctggaaa 

gctaggggac tttggttctg cgtccatcat 

40 

atactggatg gcacccgagg tgatcctggc 

ggacgtctgg tccttgggga taacctgcat 

45 taacatgaat gcgatgagtg ccttatacca 

gtcaggacac tggtctgagt acttccggaa 

tcaagacaga ccaacctcag aggttctcct 

50 

acccacagtc atcatggacc tgatccagag 

cctgcagtac cgcaagatga agaagatcct 

55 cgaggcccca gaggaggaag aggaggccga 

cagcctcgag agtagccact cagtgcccag 



cgtgcaccag 


tatccggggt 


tcattccccg 


60 


ccgttcagat 


attcggggtt 


cagaccccac 


120 


ccctcgacga 


gggggaggac 


tggaccgcga 


180 


caggggaggc 


ttcccgggcc 


cgcccctcag 


240 


ggggaaaggt 


ctccttgccc 


ctctccctgc 


300 


aagcggaggc 


gctggggcac 


catagtgacc 


360 


cccaggggcc 


accatgccag 


ctgggggccg 


420 


tgagctcttc 


ttcaaggatg 


acccagaaaa 


480 


tggcagcttt 


ggagccgtat 


actttgcccg 


540 


caagaagatg 


tcctacagtg 


ggaagcagtc 


600 


ggtgcggttc 


ttacagaagc 


tccggcatcc 


c c t\ 


gagggagcac 


acggcttggc 


tggtaatgga 


720 


agaagtgcac 


aagaaacccc 


ttcaggaggt 


780 


tcagggcctg 


gcatatctgc 


actcccacaa 


840 


catcctgctg 


tcagagccag 


ggttagtgaa 


900 


ggcacctgcc 


aactccttcg 


tgggcacccc 


960 


catggatgag gggcagtacg 


atggcaaagt 


1020 


cgagctggct 


gaacggaaac 


caccgctctt 


1080 


cattgcacag 


aacgaatccc 


ccgtgctcca 


1140 


ttttgtcgac 


tcctgtcttc 


agaaaatccc 


1200 


gaagcaccgc 


tttgtgctcc 


gggagcggcc 


1260 


gaccaaggat 


gccgtgcggg 


agctggacaa 


1320 


gttccaagag 


gcacccaacg 


gccctggtgc 


1380 


gccctacatg 


caccgggccg 


ggactctgac 


1440 


catgtccatc 


agcgcctcca 


gccagagcag 


1500 
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ctccgtcaac agcctagcag atgcctcaga 
agaggaggag gaggaagaag gccctgaagc 

5 

gcacacagtc acctctcaca gctccattat 
tgatgacccc taccagccag agataacccc 
10 tcccacttcc accacctctt ctgcccgccg 



tgccaccatc cgaaccgcct ccctggtcag 



tgcgctgcgg gagcagctga gcggctataa 

15 

gctggccctg gagtcacggc tgaggggtga 



ggagcttgag gcgcagcggg ctggctttgg 



20 ccaggccata ggtgagaagg aggcacgagc 



gcacatcctt gggcagcaga agaaggagct 



ctacaaactt cgcaaggaac agctgaagga 

25 

gcgggagaag gccgagtggc tgctgcggca 



ggaggaagca gggctgctgc ggcggcagcg 



30 caagcgcaag atgttgctgg ctcggcacag 



gaacaagaag cagacccaga aggacttgga 



cacgcgggag ctggagctgc ggcagctcca 

35 

ccgcctgcag caccagacgg agctgggcaa 



agagttgcgg cagaagcatg cggcccaggt 



40 ggagctgcag atcaagaagc agttccagga 
ggctctgcga gcacacttgc tggagaccac 



gcggctcaag gaagagcaga cccgcaagct 

45 

catctcagag atgctcagct -cacaggcgct 



ccaggccctt cggcagcagc ttcaacagga 



50 gatcaagatc cgcacagaga gccagcacga 



cgcgctgcgg cgggcactgc tggagcagcg 



aggacgctcc gagcgaatcc gcagtctgct 

55 

cgatgcggaa agcatgaggc tgggcttctc 



caacgaggaa gaggaggagg 


aggaggagga 


1560 


ccgggagatg gccatgatgc 


aggaggggga 


1620 


ccaccggctg ccgggctctg 


acaacctata 


1680 


cagccctctc cagccgcctg 


cagccccagc 


1740 


ccgggcctac tgccgtaacc 


gagaccactt 


1800 


ccgtcagatc caggagcatg 


agcaggactc 


1860 


gcggatgcga cgacagcacc 


agaagcagct 


1920 


acgggaggag cacagtgcac 


ggctgcagcg 


1980 


ggcagaggca gaaaagctgg 


cccggcggca 


2040 


tgcccaggcc gaggagcgga 


agttccagca 


2100 


ggctgccctg ctggaggcac 


agaagcggac 


2160 


ggagctccag gagaacccca 


gcactcccaa 


2220 


gaaggagcag ctccagcagt 


gccaggcgga 


2280 


ccagtacttt gagctgcagt 


gtcgccagta 


2340 


cctggaccag gacctgctgc gggaggacct 


2400 


gtgtgcactg ctgcttcggc 


agcacgaggc 


2460 


ggccgtgcag cgcacgcggg 


ctgagctcac 


2520 


ccagctggag tacaacaagc 


ggcgtgagca 


2580 


tcgccagcag cccaagagcc 


tcaaatctaa 


2640 


gacgtgtaag atccagactc 


ggcagtacaa 


2700 


gcccaaagct cagcacaaga 


gcctccttaa 


2760 


ggcgatcttg gcggagcagt 


atgaccagtc 


2820 


gcggcttgat gagacccagg 


aggcagagtt 


2880 


gctggagctg ctcaacgctt 


accagagcaa 


2940 


gagggagctg cgggagctgg 


agcagagggt 


3000 


ggtggaagag gagctgctgg 


ccctgcagac 


3060 


tgagcggcag gcccgtgaga 


tcgaggcctt 


3120 


cagcatggct ctggggggca 


tcccggctga 


3180 
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10 



20 



30 



agccyctgcc 


cagggc tatc 




CCCtyCCCCd 


gcctggccct 


cccgtcccgt 


*a /i n 
3240 


LLLtCyL LLL 


y y yy ca uaL l. 


yy ctyt-c^uyy 


LLLLLCCCCa 


ccaggcatgc 


cccct ccagc 


*a *j n n 


CCyyCy CCag 


CCytCCCCyC 


f- r*i ^ 

tyyccccccc 


aggcccccca 


aactggctgg 


ggccccccac 


3360 


acaaagtggg 


acaccccgtg 


gcggagccct 


gctgctgcta 


agaaacagcc 


cccagcccct 


3420 


gcggcgggca 


gcctcggggg 


gcagtggcag 


tgagaatgtg 


ggcccccctg 


ctgccgcggt 


3480 


gcccgggccc 


ctgagccgca 


gcaccagtgt 


cgcttcccac 


atcctcaatg 


gttcttccca 


3540 


ct cctac ccc 


tgaggtgcag 


c 9999aggag 


cagatgagct 


gggcagggca 


ggggtgggtg 


3600 


gagcctgacc 


ctggagggca 


ctgagctgga 


ggcccctgca 


agggtagggg 


acaagatgta 


3 660 


ggct ccagct 


cccctcagac 


ctcctcatct 


catgagcttc 


ttggggctgg 


ccagtggccc 


3 720 


agggccagct 


tggcgataga 


tgcctcaagg 


ctgcctggga 


gccccgcctc 


cctaccatgg 


3780 


tgccaggggt 


ctccctccgc 


cacctaggaa 


aggagggaga 


tgtgcgtgtc 


aaatat teat 


3840 


ctagtcccct 


gggggagggg 


aagggtgggt 


ctagacatac 


tatattcaga 


gaactatact 


3900 


accctcacag 


tgaggccctc 


agacctgcca 


cagggcagag 


caggtctggg 


gcctgaggca 


3960 


gggagaatga 


gaggccacct 


tactggcagg aaggatcagg 


atggggtctt 


ggggtcagga 


4020 


tgcctgggtc 


tcttcccgta 


actgtctgac 


gtcctgtgcc 


gtcttgtcct 


ttatcttttt 


4080 


tttttttttt 


taattgggat 


cagggctggg gcggggaaac 


aagggaagga 


ccttggaagg 


4140 


ggctgctccc 


aggcctgggg 


ggcagtcgtg ggagcccctc 


tcagctgtgg 


ggctggcaca 


4200 


gagccccagg 


caagctttta 


ataaactgtt ggttattcta 


ac 




4242 



<210> 8 
40 <211> 1049 
<212> PRT 
<213> Homo sapiens 

45 

<300> 

<308> Genbank NP_004774.1 

50 

<309> 2000-11-01 
<313> (1) . . (1049) 

55 



<400> 



8 
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Met Pro Ala Gly Gly Arg Ala Gly Ser Leu Lys Asp Pro Asp Val Ala 
15 10 15 

Glu Leu Phe Phe Lys Asp Asp Pro Glu Lys Leu Phe Ser Asp Leu Arg 
5 20 25 30 

Glu He Gly His Gly Ser Phe Gly Ala Val Tyr Phe Ala Arg Asp Val 
35 40 45 

10 Arg Asn Ser Glu Val Val Ala He Lys Lys Met Ser Tyr Ser Gly Lys 
50 55 60 

Gin Ser Asn Glu Lys Trp Gin Asp He He Lys Glu Val Arg Phe Leu 
65 70 75 80 

15 

Gin Lys Leu Arg His Pro Asn Thr He Gin Tyr Arg Gly Cys Tyr Leu 
85 90 95 

Arg Glu His Thr Ala Trp Leu Val Met Glu Tyr Cys Leu Gly Ser Ala 
20 100 105 110 

Ser Asp Leu Leu Glu Val His Lys Lys Pro Leu Gin Glu Val Glu He 
115 120 125 

25 Ala Ala Val Thr His Gly Ala Leu Gin Gly Leu Ala Tyr Leu His Ser 
130 135 140 

His Asn Met He His Arg Asp Val Lys Ala Gly Asn He Leu Leu Ser 
145 150 155 160 

30 

Glu Pro Gly Leu Val Lys Leu Gly Asp Phe Gly Ser Ala Ser He Met 
165 170 175 

Ala Pro Ala Asn Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu 
35 180 185 190 

Val He Leu Ala Met Asp Glu Gly Gin Tyr Asp Gly Lys Val Asp Val 
195 200 205 

40 Trp Ser Leu Gly He Thr Cys He Glu Leu Ala Glu Arg Lys ' Pro Pro 
210 215 220 

Leu Phe Asn Met Asn Ala Met Ser Ala Leu Tyr His He Ala Gin Asn 
225 230 235 240 

45 

Glu Ser Pro Val Leu Gin Ser Gly His Trp Ser Glu Tyr Phe Arg Asn 
245 250 255 

Phe Val Asp Ser Cys Leu Gin Lys He Pro Gin Asp Arg Pro Thr Ser 
50 260 265 270 

Glu Val Leu Leu Lys His Arg Phe Val Leu Arg Glu Arg Pro Pro Thr 
275 280 285 

55 Val He Met Asp Leu He Gin Arg Thr Lys Asp Ala Val Arg Glu Leu 
290 295 . 300 
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Asp Asn Leu Gin Tyr Arg Lys Met Lys Lys lie Leu Phe Gin Glu Ala 
305 310 315 320 

Pro Asn Gly Pro Gly Ala Glu Ala Pro Glu Glu Glu Glu Glu Ala Glu 
325 330 335 

Pro Tyr Met His Arg Ala Gly Thr Leu Thr Ser Leu Glu Ser Ser His 
340 . 345 350 

Ser Val Pro Ser Met Ser He Ser Ala Ser Ser Gin Ser Ser Ser Val 
355 360 365 

Asn Ser Leu Ala Asp Ala Ser Asp Asn Glu Glu Glu Glu Glu Glu Glu 
370 375 380 

Glu Glu Glu Glu Glu Glu Glu Glu Gly Pro Glu Ala Arg Glu Met Ala 
385 390 395 400 

Met Met Gin Glu Gly Glu His Thr Val Thr Ser His Ser Ser He He 
405 410 415 

His Arg Leu Pro Gly Ser Asp Asn Leu Tyr Asp Asp Pro Tyr Gin Pro 
420 425 430 

Glu He Thr Pro Ser Pro Leu Gin Pro Pro Ala Ala Pro Ala Pro Thr 
435 440 445 

Ser Thr Thr Ser Ser Ala Arg Arg Arg Ala Tyr Cys Arg Asn Arg Asp 
450 455 460 

His Phe Ala Thr He Arg Thr Ala Ser Leu Val Ser Arg Gin He Gin 
465 470 475 480 

Glu His Glu Gin Asp Ser Ala Leu Arg Glu Gin Leu Ser Gly Tyr Lys 
485 490 495 

Arg Met Arg Arg Gin His Gin Lys Gin Leu Leu Ala Leu Glu Ser Arg 
500 505 510 

Leu Arg Gly Glu Arg Glu Glu His Ser Ala Arg Leu Gin Arg Glu Leu 
515 520 525 

Glu Ala Gin Arg Ala Gly Phe Gly Ala Glu Ala Glu Lys Leu Ala Arg 
530 535 540 

Arg His Gin Ala He Gly Glu Lys Glu Ala Arg Ala Ala Gin Ala Glu 
545 550 555 560 

Glu Arg Lys Phe Gin Gin His He Leu Gly Gin Gin Lys Lys Glu Leu 
565 570 575 

Ala Ala Leu Leu Glu Ala Gin Lys Arg Thr Tyr Lys Leu Arg Lys Glu 
58 0 585 590 

Gin Leu Lys Glu Glu Leu Gin Glu Asn Pro Ser Thr Pro Lys Arg Glu 
595 600 605 
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Lys Ala Glu Trp Leu Leu Arg Gin Lys Glu Gin Leu Gin Gin Cys Gin 
610 615 620 

Ala Glu Glu Glu Ala Gly Leu Leu Arg Arg Gin Arg Gin Tyr Phe Glu 
625 630 635 640 

Leu Gin Cys Arg Gin Tyr Lys Arg Lys Met Leu Leu Ala Arg His Ser 
645 650 655 

Leu Asp Gin Asp Leu Leu Arg Glu Asp Leu Asn Lys Lys Gin Thr Gin 
660 665 670 

Lys Asp Leu Glu Cys Ala Leu Leu Leu Arg Gin His Glu Ala Thr Arg 
675 680 685 

Glu Leu Glu Leu Arg Gin Leu Gin Ala Val Gin Arg Thr Arg Ala Glu 
690 695 700 

Leu Thr Arg Leu Gin His Gin Thr Glu Leu Gly Asn Gin Leu Glu Tyr 
705 710 715 720 

Asn Lys Arg Arg Glu Gin Glu Leu Arg Gin Lys His Ala Ala Gin Val 
725 730 735 

Arg Gin Gin Pro Lys Ser Leu Lys Ser Lys Glu Leu Gin lie Lys Lys 
740 -745 750 

Gin Phe Gin Glu Thr Cys Lys lie Gin Thr Arg Gin Tyr Lys Ala Leu 
755 760 765 

Arg Ala His Leu Leu Glu Thr Thr Pro Lys Ala Gin His Lys Ser Leu 
770 775 780 

Leu Lys Arg Leu Lys Glu Glu Gin Thr Arg Lys Leu Ala lie Leu Ala 
785 790 795 800 

Glu Gin Tyr Asp Gin Ser lie Ser Glu Met Leu Ser Ser Gin Ala Leu 
805 810 815 

Arg Leu Asp Glu Thr Gin Glu Ala Glu Phe Gin Ala Leu Arg Gin Gin 
820 825 830 

Leu Gin Gin Glu Leu Glu Leu Leu Asn Ala Tyr Gin Ser Lys lie Lys 
835 840 845 

lie Arg Thr Glu Ser Gin His Glu Arg Glu Leu Arg Glu Leu Glu Gin 
850 855 860 

Arg Val Ala Leu Arg Arg Ala Leu Leu Glu Gin Arg Val Glu Glu Glu 
865 870 875 880 

Leu Leu Ala Leu Gin Thr Gly Arg Ser Glu Arg lie Arg Ser Leu Leu 
885 890 895 

Glu Arg Gin Ala Arg Glu He Glu Ala Phe Asp Ala Glu Ser Met Arg 
900 905 910 
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Leu Gly Phe Ser Ser Met Ala Leu Gly Gly lie Pro Ala Glu Ala Ala 
915 920 925 

Ala Gin Gly Tyr Pro Ala Pro Pro Pro Ala Pro Ala Trp Pro Ser Arg 
930 935 940 

Pro Val Pro Arg Ser Gly Ala His Trp Ser His Gly Pro Pro Pro Pro 
945 950 955 960 

Gly Met Pro Pro Pro Ala Trp Arg Gin Pro Ser Leu Leu Ala Pro Pro 
965 970 975 

Gly Pro Pro Asn Trp Leu Gly Pro Pro Thr Gin Ser Gly Thr Pro Arg 
980 985 990 

Gly Gly Ala Leu Leu Leu Leu Arg Asn Ser Pro Gin Pro Leu Arg Arg 
995 1000 1005 

Ala Ala Ser Gly Gly Ser Gly Ser Glu Asn Val Gly Pro Pro Ala 
1010 1015 1020 

Ala Ala Val Pro Gly Pro Leu Ser Arg Ser Thr Ser Val Ala Ser 
1025 1030 1035 

His lie Leu Asn Gly Ser Ser His Phe Tyr Ser 
1040 1045 

<210> 9 

<211> 3285 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> (3245) . . (3245) 

<223> a, t, c or g 

<220> 

<221> misc_feature 

<222> (3263) . . (3263) 

<223> a, t, c or g 



<220> 
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<221> misc_f eature 
<222> (3278) . . (3278) 
5 <223> a, t, c or g 

<220> 

10 <221> misc_feature 

<222> (72) . . (72) 

<223> a, t, c or g 



15 



20 



25 



30 



35 



40 



45 



50 



55 



<300> 

<308> EMBL AY049015 

<309> 2001-10-15 

<313> (1) . . (3285) 



<400> 9 
caacggattt 


catttcatac 


agatgaacca 


aggatcggga 


tagcagtata 


aaattagaat 


60 


caagacagct 


gnctgccaag 


caggatgcca 


tcaactaaca 


gagcaggcag 


cctgaaggac 


120 


cctgaaattg 


cagagctctt 


cttcaaagaa 


gatccagaga 


agctcttcac 


agatctcaga 


180 


gaaattggcc 


atggaagctt 


tggagcagtg 


tattttgcac 


gagatgtgcg 


taccaatgaa 


240 


gtggtggcca 


tcaagaaaat 


gtcttatagt 


ggaaagcagt 


ctactgagaa 


atggcaggat 


300 


attattaagg 


aagtcaagtt 


tctacaaaga 


ataaaacatc 


ccaacagtat 


agaatacaaa 


360 


ggctgttatt 


tacgtgaaca 


cacagcatgg 


cttgtaatgg 


aatattgttt 


aggatctgct 


420 


tcggatttac 


tagaagttca 


caaaaagcca 


ttacaagaag 


tggaaatagc 


agcaattaca 


480 


catggtgctc 


ttcagggatt 


agcctactta 


cattctcata 


ctatgattca 


tagagatatc 


540 


aaagcaggaa 


atatccttct 


gacagaacca 


ggccaggtga 


aacttgctga 


ctttggctct 


600 


gcttccatgg 


catcacctgc 


caattccttt 


gtgggaacgc 


cgtattggat 


ggccccagaa 


660 


gtaattttag 


ccatggatga 


aggacaatat 


gatggcaaag 


tagatgtgtg 


gtctcttgga 


720 


ataacatgta 


ttgaactagc 


ggaaaggaag 


cctcctttat 


ttaatatgaa 


tgcaatgagt 


780 


gccttatatc 


acatagccca 


aaatgaatcc 


cctacactac 


agtctaatga 


atggtctgat 


840 


tattttcgca 


actttgtaga 


ttcttgcctc 


cagaaaatcc 


ctcaagatcg 


acctacatca 


900 


gaggaacttt 


taaagcacat 


atttgttctt 


cgggagcgcc 


ctgaaaccgt 


gttaatagat 


960 
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ctcattcaga ggacaaagga tgcagtaaga 
aagaaactcc ttttccagga ggcacataat 

5 

gaggaacaag atcatggtgt tggccggaca 
tccattccca gcatgtccat cagtgccagc 
10 gatgtctcag atgacaagag tgagctagac 
aacagttctg ttatccattt aaaaccagag 
agaacaagag catcagatcc acaatctcca 

15 

cgtaatcgag aacactttgc tactatacgg 
gaacatgagc aggactctga gcttagagaa 
20 caacatcaaa agcaactgat gactctggaa 
cgcctcagat tagacaaaga tcttgaaact 
aaacttatca agaaacacca ggctgctatg 

25 

gagaaaaaat ttcagcaaca tattcaggcc 
gagtcccaga aaagagagta taaacttcga 
30 aaccagagta cccccaaaaa agaaaaacag 
cagcatttcc aagcagaaga agaagctaac 
ctggaatgcc gtcgcttcaa gagaagaatg 

35 

cttgtcaggg aggagttaaa caaaagacag 
ctccgacagc atgaatctat gcaagaactg 
40 atgcgctgtg agttgatcag attacagcat 
aataagcgaa gagaacgaga actaagacga 
aagagtttga agtctaaaga actccaaata 

45 

caaaccagac agtacaaagc attaagaaat 
cacaaagctg ttctgaaacg gctcaaggag 
50 gagcagtatg atcacagcat taatgaaatg 
gcacaggaag cagagtgcca ggttttgaag 
aatgcgtatc agagcaaaat caagatgcaa 

55 

gagcttgaac agagggtctc cctccggagg 



gagctggaca 


atctgcagta 


tcgaaagatg 


1020 


ggaccagcag 


tagaagcaca 


ggaagaagaa 


1080 


ggaacagtta 


atagtgttgg 


aagtaatcaa 


1140 


agccaaagca gtagtgttaa 


cagtcttcca 


1200 


atgatggagg gagaccacac 


agtgatgtct 


1260 


gaagaaaatt 


acagagaaga 


gggagatcct 


1320 


ccccaagtat 


ctcgtcacaa 


atcacactat 


1380 


acagcatcac 


tggttacgag 


gcaaatgcaa 


1440 


caaatgtctg gctataagcg 


aatgaggcga 


1500 


aacaagctaa 


aggctgagat 


ggatgaacat 


1560 


cagcgtaaca 


attttgctgc 


agaaatggag 


1620 


gagaaagagg 


ctaaagtgat 


gtccaatgaa 


1680 


caacagaaga 


aagaactgaa 


tagttttctc 


1740 


aaagagcagc 


ttaaagagga 


gctaaatgaa 


1800 


gagtggcttt 


caaagcagaa 


ggagaatata 


1860 


cttcttcgac 


gtcaaagaca 


atacctagag 


1920 


ttacttgggc 


gtcataactt 


agagcaggac 


1980 


actcagaagg 


acttagagca 


tgccatgcta 


2040 


gagttccgcc 


acctcaacac 


aattcagaag 


2100 


caaactgagc 


tcactaacca 


gctggaatat 


2160 


aagcatgtca. 


tggaagttcg 


acaacagcct 


2220 


aaaaagcagt 


ttcaggatac 


ctgcaaaatc 


2280 


cacctgctgg 


agactacacc 


aaagagtgag 


2340 


gaacagaccc 


ggaaattagc 


tatcttggct 


2400 


ctctccacac 


aagccctgcg 


tttggatgaa 


2460 


atgcagctgc 


agcaggaact 


ggagctgttg 


2520 


gctgaggcac 


aacatgatcg 


agagcttcgc 


2580 


gcactcttag 


aacaaaagat 


tgaagaagag 


2640 
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atgttggctt tgcagaatga gcgcacagaa cgaatacgaa gcctgttgga acgtcaagcc 2700 

agagagattg aagcttttga ctctgaaagc atgagactag gttttagtaa tatggtgctt 2760 

tctaatctct cccctgaggc attcagccac agctacccgg gagcttctgg ttggtcacac 2820 

aaccctactg ggggtccagg acctcactgg ggtcatccca tgggtggccc accacaagct 2880 

tggggccatc caatgcaagg tggaccccag ccatggggtc acccttcagg gccaatgcaa 2 94 0 

ggggtacctc gaggtagcag tatgggagtc cgcaatagcc cccaagctct gaggcggaca 3 000 

gcttctgggg gacggacgga gcagggcatg agcagaagca cgagtgtcac ttcacaaata 3060 

tccaatgggt cacacatgtc ttatacataa cttaataatt gagagtggca attccgctgg 3120 

agctgtctgc caaaagaaac tgcctacaga catcatcaca gcagcctcct cacttgggta 3180 

ctacagtgtg gaagctgagt gcatatggta tattttattc atttttgtaa agcgttctgt 3240 

tttgngttta ctaattggga tgncatagta cttggctncc cgggt 3285 

<210> 10 

<211> 1005 

<212> PRT 

<213> Homo sapiens 

<300> 

<3 08> SpTrEMBL 

<309> 2000-10-01 

<313> (1) . . (1005) 

<400> 10 

Leu Leu Ser Arg Met Pro Ser Thr Asn Arg Ala Gly Ser Leu Lys Asp 
1 5 10 15 

Pro Glu lie Ala Glu Leu Phe Phe Lys Glu Asp Pro Glu Lys Leu Phe 
20 25 30 

Thr Asp Leu Arg Glu He Gly His Gly Ser Phe Gly Ala Val Tyr Phe 
35 40 45 

Ala Arg Asp Val Arg Thr Asn Glu Val Val Ala He Lys Lys Met Ser 
50 55 60 

Tyr Ser Gly Lys Gin Ser Thr Glu Lys Trp Gin Asp He He Lys Glu 
65 70 75 80 
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val Lys Phe Leu Gin Arg lie Lys His Pro Asn Ser lie Glu Tyr Lys 
85 90 95 

5 Gly Cys Tyr Leu Arg Glu His Thr Ala Trp Leu Val Met Glu Tyr Cys 
100 105 110 

Leu Gly Ser Ala Ser Asp Leu Leu Glu Val His Lys Lys Pro Leu Gin 
115 120 125 

10 

Glu Val Glu lie Ala Ala lie Thr His Gly Ala Leu Gin Gly Leu Ala 
130 135 140 

Tyr Leu His Ser His Thr Met lie His Arg Asp lie Lys Ala Gly Asn 
15 145 • 150 155 160 

He Leu Leu Thr Glu Pro Gly Gin Val Lys Leu Ala Asp Phe Gly Ser 
165 170 175 

20 Ala Ser Met Ala Ser Pro Ala Asn Ser Phe Val Gly Thr Pro Tyr Trp 
180 185 190 

Met Ala Pro Glu Val He Leu Ala Met Asp Glu Gly Gin Tyr Asp Gly 
195 200 205 

25 

Lys Val Asp Val Trp Ser Leu Gly He Thr Cys He Glu Leu Ala Glu 
210 215 220 

Arg Lys Pro Pro Leu Phe Asn Met Asn Ala Met Ser Ala Leu Tyr His 
30 225 230 235 240 

He Ala Gin Asn Glu Ser Pro Thr Leu Gin Ser Asn Glu Trp Ser Asp 
245 250 255 

35 Tyr Phe Arg Asn Phe Val Asp Ser Cys Leu Gin Lys He Pro Gin Asp 
260 265 270 

Arg Pro Thr Ser Glu Glu Leu Leu Lys His He Phe Val Leu Arg Glu 
275 280 285 

Arg Pro Glu Thr Val Leu He Asp Leu He Gin Arg Thr Lys Asp Ala 
290 295 300 



40 



Val Arg Glu Leu Asp Asn Leu Gin Tyr Arg Lys Met Lys Lys Leu Leu 
45 305 310 315 320 

Phe Gin Glu Ala His Asn Gly Pro Ala Val Glu Ala Gin Glu Glu Glu 
325 330 335 

50 Glu Glu Gin Asp His Gly Val Gly Arg Thr Gly Thr Val Asn Ser Val 
340 345 350 

Gly Ser Asn Gin Ser He Pro Ser Met Ser He Ser Ala Ser Ser Gin 
355 360 365 

55 

Ser Ser Ser Val Asn Ser Leu Pro Asp Val Ser Asp Asp Lys Ser Glu 
370 375 380 



> 
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Leu Asp Met Met Glu Gly Asp His Thr Val Met Ser Asn Ser Ser Val 
385 390 395 400 

5 lie His Leu Lys Pro Glu Glu Glu Asn Tyr Arg Glu Glu Gly Asp Pro 

405 410 415 

Arg Thr Arg Ala Ser Asp Pro Gin Ser Pro Pro Gin Val Ser Arg His 
420 425 430 

10 

Lys Ser His Tyr Arg Asn Arg Glu His Phe Ala Thr lie Arg Thr Ala 
435 440 445 

Ser Leu Val Thr Arg Gin Met Gin Glu His Glu Gin Asp Ser Glu Leu 
15 450 455 460 

Arg Glu Gin Met Ser Gly Tyr Lys Arg Met Arg Arg Gin His Gin Lys 
465 470 475 480 

20 Gin Leu Met Thr Leu Glu Asn Lys Leu Lys Ala Glu Met Asp Glu His 

485 490 495 

Arg Leu Arg Leu Asp Lys Asp Leu Glu Thr Gin Arg Asn Asn Phe Ala 
500 505 510 

25 

Ala Glu Met Glu Lys Leu He Lys Lys His Gin Ala Ala Met Glu Lys 
515 ' 520 525 

Glu Ala Lys Val Met Ser Asn Glu Glu Lys Lys Phe Gin Gin His He 
30 530 535 540 

Gin Ala Gin Gin Lys Lys Glu Leu Asn Ser Phe Leu Glu Ser Gin Lys 
545 550 555 560 

35 Arg Glu Tyr Lys Leu Arg Lys Glu Gin Leu Lys Glu Glu Leu Asn Glu 

565 570 575 

Asn Gin Ser Thr Pro Lys Lys Glu Lys Gin Glu Trp Leu Ser Lys Gin 
580 585 590 

40 

Lys Glu Asn He Gin His Phe Gin Ala Glu Glu Glu Ala Asn Leu Leu 
595 600 605 

Arg Arg Gin Arg Gin Tyr Leu Glu Leu Glu Cys Arg Arg Phe Lys Arg 
45 610 615 620 

Arg Met Leu Leu Gly Arg His Asn Leu Glu Gin Asp Leu Val Arg Glu 
625 630 635 640 

50 Glu Leu Asn Lys Arg Gin Thr Gin Lys Asp Leu Glu His Ala Met Leu 

645 650 655 

Leu Arg Gin His Glu Ser Met Gin Glu Leu Glu Phe Arg His Leu Asn 
660 665 670 

55 

Thr He Gin Lys Met Arg Cys Glu Leu He Arg Leu Gin His Gin Thr 
675 680 685 



WO 2004/056982 PCT/EP2003/0 14674 

76 



Glu Leu Thr Asn Gin Leu Glu Tyr Asn Lys Arg Arg Glu Arg Glu Leu 
690 695 700 

5 Arg Arg Lys His Val Met Glu Val Arg Gin Gin Pro Lys Ser Leu Lys 
705 710 715 720 

Ser Lys Glu Leu Gin lie Lys Lys Gin Phe Gin Asp Thr Cys Lys lie 
725 730 735 

10 

Gin Thr Arg Gin Tyr Lys Ala Leu Arg Asn His Leu Leu Glu Thr Thr 
740 745 750 

Pro Lys Ser Glu His Lys Ala Val Leu Lys Arg Leu Lys Glu Glu Gin 
15 755 760 765 

Thr Arg Lys Leu Ala lie Leu Ala Glu Gin Tyr Asp His Ser lie Asn 
770 775 780 

20 Glu Met Leu Ser Thr Gin Ala Leu Arg Leu Asp Glu Ala Gin Glu Ala 
785 790 795 800 

Glu Cys Gin Val Leu Lys Met Gin Leu Gin Gin Glu Leu Glu Leu Leu 
805 810 815 

25 

Asn Ala Tyr Gin Ser Lys lie Lys Met Gin Ala Glu Ala Gin His Asp 
820 825 830 

Arg Glu Leu Arg Glu Leu Glu Gin Arg Val Ser Leu Arg Arg Ala Leu 
30 835 840 845 

Leu Glu Gin Lys lie Glu Glu Glu Met Leu Ala Leu Gin Asn Glu Arg 
850 855 860 

35 Thr Glu Arg lie Arg Ser Leu Leu Glu Arg Gin Ala Arg Glu lie Glu 
865 870 875 880 

Ala Phe Asp Ser Glu Ser Met Arg Leu Gly Phe Ser Asn Met Val Leu 
885 890 895 

40 

Ser Asn Leu Ser Pro Glu Ala Phe Ser His Ser Tyr Pro Gly Ala Ser 
900 905 910 

Gly Trp Ser His Asn Pro Thr Gly Gly Pro Gly Pro His Trp Gly His 
45 915 920 925 

Pro Met Gly Gly Pro Pro Gin Ala Trp Gly His Pro Met Gin Gly Gly 
930 935 940 

50 Pro Gin Pro Trp Gly His Pro Ser Gly Pro Met Gin Gly Val Pro Arg 
945 950 955 960 

Gly Ser Ser Met Gly Val Arg Asn Ser Pro Gin Ala Leu Arg Arg Thr 
965 970 975 

55 

Ala Ser Gly Gly Arg Thr Glu Gin Gly Met Ser Arg Ser Thr Ser Val 
980 985 990 
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Thr Ser Gin lie Ser Asn Gly Ser His Met Ser Tyr Thr 
995 1000 1005 



<210> 11 

<211> 1576 

10 <212> DNA 

<213> artificial sequence 

15 <220> 

<223> RNAi fragment T17E9.1a (kin-18) 

<400> 11 

20 



25 



30 



35 



40 



45 



50 



55 



cgaaaaccag 


cagaagagcg 


aatgtcagct 


gaagaatgct 


ttcgacatcc 


attcattcaa 


60 


cggtctcgcc 


catcagacac 


aattcaggaa 


ctcattcaga 


gaacgaaaaa 


tatggtatta 


120 


gagttggata 


attttcaata 


caaaaagatg 


agaaaactca 


tgtatttgga 


tgaaacagaa 


180 


ggaaaagaag 


gaagtgaagg 


aaatggagca 


tctgatgatt 


tagattttca 


tggaaatgaa 


240 


gctaattcaa 


ttggaagagg 


tagtttttaa 


aattcaaagt 


gaaaatatta 


atatcttgga 


300 


ataattttta 


taatattgct 


ttaaaccctc 


agcttttttt 


tgcagactct 


atcccttagt 


360 


tgttcgtttt 


ccatctattc 


tcgttttcag 


caggagattc 


tgcgtcatct 


cgaagtgctt 


420 


ctcttacttc 


tttccgatca 


atgcagagta 


gtggaggagc 


tggtctttta 


gtgtccacca 


480 


atacgacggg 


tgctatggat 


aatgtgcatg 


gtactgtact 


gttttttttg 


ttttaggaat 


540 


ggctttatta 


tttcctgcaa 


agttcaaaaa 


ttccatttat 


tttagttttt 


ctctcgaaat 


600 


tcatcgcgca 


acattgagaa 


tctttcaaaa 


ttttcaggat 


cctctggata 


cggtaatgga 


660 


agtagttcga 


cgacgagctc 


cgcacgccgc 


cgtcctccaa 


ttccttcgca 


aatgctctct 


720 


tctacatcaa 


cgtctggtgt 


tggaactatg 


ccgagtcatg 


gatcagttgg 


agcatcgatt 


780 


acggcgatcg 


cagtcaatcc 


aacaccgtct 


ccttcagaac 


ctatcccaac 


atcacaacca 


840 


acatcgaaat 


cagaatcatc 


ttctatactc 


gaaactgcac 


acgatgatcc 


tttggacacg 


900 


tcgatacgtg 


ctccagtgaa 


agacttgcat 


atgccgcatc 


gagcagtcaa 


ggaacgaata 


960 


gccacgttgc 


aaaatcacaa 


attcgcgacg 


cttcgttccc 


agagaataat 


caatcaggaa 


102O 


caagaagaat 


atacgaaaga 


gaacaatatg 


tatgagcaaa 


tgagcaagta 


caagcatcta 


1080 


cgacaagcac 


atcacaaaga 


gctccaacaa 


tttgaagaac 


gatgtgcatt 


agatagagag 


1140 



10 
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caactgcgtg tgaaaatgga tcgagaactc gaacaattga caacgacata ctcgaaagaa 1200 

aagatgagag tgaggtgttc acagaataat gaactagaca aacggaaaaa agatatcgaa 1260 

gatggggaga aaaagatgaa aaagacgaaa aatagtcaaa atcagcagca gatgaaactg 1320 

tattcagcgc aacaattgaa agaatacaag tataacaagg aggcacagaa aacagtgaga 1380 

attcactttt atttgatttc tgtaaagaaa ttatacgaaa tttagacttt ataaattttt 1440 

aaatatgaaa gttctggtca ctttttcagc tgcttctcca cttttttcaa agtttattat 1500 

ttagtcttga ataatttttt aaaaaatgtc ctaaaccaag aattttcagc gattacgaag 1560 

15 tctgaacatg cctcga 1576 

<210> 12 

20 <211> 20 

<212> DNA 

<213> artificial sequence 

25 

<220> 

<223> primer 



30 



40 



45 



50 



<400> 12 

cgaaaaccag cagaagagcg 20 



35 <210> 13 

<211> 21 

<212> DNA 

<213> artificial sequence 



<220> 

<223> primer 
<400> 13 

tcgaggcatg ttcagacttc g 21 



